
FIG. 1 



AlaSerCysLeuAsnCysSerAlaSerllelleProAspArgGluValLeuTyrAxgOlu 
1 GGCCTCCTGCTTGAACTGCTCGGCGAGCATCATACCTGACAGGGAAGTCCTCTACCGAGA 
CCGGAGGACGAACTTGACGAGCCGCTCGTAGTATGGACTGTCCCTTCAGGAGATGGCTCT 

PheAspGluMetGluGluCysSerGlnHisLeuProTyrlleGluGlnGlyMetMetLeu 
6 1 GTTCGATGAGATGGAAGAGTGCTCTCAGCACTTACCGTACATCGAGCAAGGGATGATGCT 
CAAGCTACTCTACCTTCTCACGAGAGTCGTGAATGGCATGTAGCTCGTTCCCTACTACGA 

AlaGluGlnPheLysGlnLysAlaLeuGlyLeu 
121 CGCCGAGCAGTTCAAGCAG AAGGCCCTCGGCCTCC 
GCGGCTCGTCAAGTTCGTCTTCCGGGAGCCGGAGG 



GlyCysValVallleValGlyArgValValLeuSerGlyLysProAlallelleProAsp 
1 CTGGCTGCGTGGTCATAGTGGGCAGGGTCGTCTTGTCCGGGAAGCCGGCAATCATACCTG 
GACCGACGCACCAGTATCACCCGTCCCAGCAGAACAGGCCCTTCGGCCGTTAGTATGGAC 

T 

ArgGluValLeuTyrArgGluPheAspGluMetGluGluCysSerGlnHisLeuProTyr 
61 ACAGGGAAGTCCTCTACCGAGAGTTCGATGAGATGGAAGAGTGCTCTCAGCACTTACCGT 
TGTCCCTTCAGGAGATGGCTCTCAAGCTACTCTACCTTCTCACGAGAGTCGTGAATGGCA 
A 

ZleGluGlnGlyMetMetLeuAlaGluGlnPheLysGlnLysAlaLeuGlyLeuLeuGln 
121 ACATCGAGCAAGGGATGATGCTCGCCGAGCAGTTCAAGCAGAAGGCCCTCGGCCTCCTGC 
TGTAGCTCGTTCCCTACTACGAGCGGCTCGTCAAGTTCGTCTTCCGGGAGCCGGAGGACG 

ThrAlaSerArgGlnAlaGluVallleAlaProAlaValGlnThrAsnTrpGlnLysLeu 
181 AGACCGCGTCCCGTCAGGCAGAGGTTATCGCCCCTGCTGTCCAGACCAACTGGCAAAAAC 
TCTGGCGCAGGGCAGTCCGTCTCCAATAGCGGGGACGACAGGTCTGGTTGACCGTTTTTG 

GluThrPheTrpAlaLysHisMetTrpAsnPhelleSerGlylleGlnTyrLeuAlaGiy 
24 1 TCGAG ACCTTCTGGGCGAAGCATATGTGGAACTTCATCAGTGGG ATACAATACTTGGCGG 
AGCTCTGGAAGACCCGCTTCGTATACACCTTGAAGTAGTCACCCTATGTTATGAACCGCC 

LeuSerThrLeuProGlyAsnProAlalleAlaSerLeuMetAlaPheThrAlaAlaVal 
301 GCTTGTCAACGCTGCCTGGTAACCGCGCCATTGCTTCATTGATGGCTTTTACAGCTGCTG 
CGAACAGTTGCGACGGACCATTGGGGCGGTAACGAAGTAACTACCGAAAATGTCGACGAC 

ThrSerProLeuThrThrSerGln 
361 TCACCAGCCCACTAACCACTAGCCAAA 
AGTGGTCGGGTGATTGGTGATCGGTTT 
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AspAlaHisPhel^euSerClnThrLysGLnSerCIyGLAJAsnLeuProTyrLeuValAla 
L GATGCCCACTTTCTATCCCAGACAAAGCAGAGTGGGGAGAACCTTCCTTACCTGGTAGCG 
CTACGGGTGAAAGATAGGGTCTGTTTCGTCTCACCCCTCTTGGAAGGAATGGACCATCGC 

TyrGlnAlaThrValCysAlaArgAlaGlnAlaProProProSerTrpAspGlnHetTrp 
61 TACCAAGCCACCGTGTGCGCTAGGGCTCAAGCCCCTCCCCCATCGTGGGACCAGATGTGG 
ATGGTTCGGTGGCACACGCGATCCCGAGTTCGGGGAGGGGGTAGCACCCTGGTCTACACC 

LysCysLeuIleAxgLeuLysProThrLeuHisGlyProThrProLeuLeuTyrArgLeu 
1 2 L AAGTGTTTGATTCGCCTCAAGCCCACCCTCCATGGGCCAACACCCCTGCTATACAGACTG 
TTCACAAACTAAGCGGAGTTCGGGTGGGAGGTACCCGGTTGTGGGGACGATATGTCTGAC 

GlyAlaValGloAsnGluIleThrLeuThrHisProValThrLysTyrlleMetThrCys 
1 8 1 GGCGCTGTTCAGAATGAAATCACCCTGACGCACCCAGTCACCAAATACATCATG ACATGC 
CCGCGACAAGTCTTACTTTAGTGGGACTGCGTGGGTCAGTGGTTTATGTAGTACTGTACG 

MetSerAlaAspLeuGluValValThrSerThrTrpValLeuVaLGLyGlyValLeuAla 
2 4 1 ATGTCGGCCGACCTGGAGGTCGTCACGAGCACCTGGGTGCTCGTTGGCGGCGTCCTGGCT 
TACAGCCGGCTGGACCTCCAGCAGTGCTtGTGGACCCACGAGCAACCGCCGCAGGACCGA 

AlaLeuAlaAlaTyrCysLeuSerThrGlyCysValVallleValGlyArgValvalLeu 
30 L GCTTTGGCCGCGTATTGCCTGTCAACAGGCTGCGTGGTCATAGTGGGCAGGGTCGTCTTG 
CGAAACCGGCGCATAACGGACAGTTGTCCGACGCACCAGTATCACCCGTCCCAGCAGAAC 

SezrGlyLysProAlallelleProAspArgGluValLeuTyrArgGLuPheAspGluMet 
361 TCCCGGAAGCCGGCAATCATACCTGACAGGGAAGTCCTCTACCGAGAGTfCGATGAGATG 
AGGCCCTTCGGCCGTTAGTATGGACTGTCCCTTCAGGAGATGGCTCTCAAGCTACTCTAC 

GluGluCysSerGlnHisLeuProTyrlleGluGlnGlyMetMetLeuAlaGluGlnPhe 
4 21 GAAGAGTGCTCTCAGCACTTACCGTACATCGAGC AAGGGATGATGCTCGCCGAGCAGTTC 
CTTCTCACGAGAGTCGTGAATGGCATGTAGCTCGTTCCCTACTACGAGCGGCTCGTCAAG 

LysGlnLysAlaLeuGLyLeuLeuGlnThrAlaSerArgGLnAlaGluVallLeAlaPro 
4 81 AAGCAGAAGGCCCTCGGCCTCCTGCAGACCGCGTCCCGTCAGGCAGAGGTTATCGCCCCT 
TTCGTCTTCCGGGAGCCGGAGGACGTCTGGCGCAGGGCAGTCCGTCTCCAATAGCGGGGA 

AlaValGlnThrAsnTrpGlnLysLeuGluThrPheTrpAlaLysHisMetTrpAsnPhe 
541 GCTGTCCAGACCAACTGGCAAAAACTCGAGACCTTCTGGGCGAAGCATATGTGGAACTTC 
CGACAGGTCTGGTTGACCGTTTTTGAGCTCTGGAAGACCCGCTTCGTATACACCTTGAAG 

IleSerGlylleGlnTyrLeuAlaGlyLeuSerThrLeuProGiyAsnProAlalleAla 
601 ATCAGTGGGATACAATACTTGGCGGGCTTGTCAACGCTGCCTGGTAACCCCGCCATTC 

TAGTCACCCTATGTTATGAACCGCCCGAACAGTTGCGACGGACCATTGGGGCGGTAACGA 

Serl^uMetMaPheThrAlaAlaValThrSerProLcuThrThrSerGln 
661 TCATTGATGGCTTTTACAGCTGCTGTCACCAGCCCACTAACCACTAGCCAAA 
AGTAACTACCGAAAATGTCGACGACAGTGGTCGGGTGATTGGTGATCGGTTT 



FIG. 7 



Overlap with 81 

PheThrAlaAlaValThrSerProLeuThrThrSerGlnThrLeuLeuPheAsnlleLeu 
1 CTTTTACAGCTGCTGTCACCAGCCCACTAACCACTAGCCAAACCCTCCTCTTCAACATAT . 
GAAAATGTCGACGACAGTGGTCGGGTGATTGGTGATCGGTTTGGGAGGAGAAGTTGTATA 

GlyGlyTrpValAlaAlaGlnLeuAlaAlaProGlyAlaAlaThrAlaPheValGlyAla 
61 TGGGGGGGTGGGTGGCTGCCCAGCTCGCCGCCCCCGGTGCCGCTACTGCCTTTGTGGGCG 
ACCCCCCCACCCACCGACGGGTCGAGCGGCGGGGGCCACGGCGATGACGGAAACACCCGC 

GlyLeuAlaGlyAlaAlalleGlySerValGlyLeuGlyLysValLeuIleAspIleLeu 
121 CTGGCTTAGCTGGCGCCGCCATCGGCAGTGTTGGACTGGGGAAGGTCCTCATAGACATCC 
GACCGAATCGACCGCGGCGGTAGCCGTCACAACCTGACCCCTTCCAGGAGTATCTGTAGG 

AlaGlyTyrGlyAlaGlyValAlaGlyAlaLeuValAlaPheLysIleMetSerGlyGlu 
1 8 1 TTGCAGGGTATGGCGCGGGCGTGGCGGGAGCTCTTGTGGCATTCAAGATCATGAGCGGTG 
AACGTCCCATACCGCGCCCGCACCGCCCTCGAGAACACCGTAAGTTCTAGTACTCGCCAC 

valProSerThrGluAspLeuValAsnLeuLeuProAlalleLeuSerProGlyAlaLeu 
241 AGGTCCCCTCCACGGAGG ACCTGGTCAATCTACTGCCCGCCATCCTCTCGCCCGG AGCCC 
TCCAGGGGAGGTGCCTCCTGGACCAGTTAGATGACGGGCGGTAGGAGAGCGGGCCTCGGG 

ValValGlyValvralCysAlaAlalleLeuArgArgHisValGlyProGlyGluGlyAla 
301 TCGTAGTCGGCGTGGTCTGTGCAGCAATACTGCGCCGGCACGTTGGCCCGGGCG AGGGGG 
AGCATCAGCCGCACCAGACACGTCGTTATGACGCGGCCGTGCAACCGGGCCCGCTCCCCC 

ValGlnTrpMetAsnArgLeuIleAlaPheAlaSerArgGlyAsnHisValSer 
361 CAGTGCAGTGGATGAACCGGCTGATAGCCTTCGCCTCCCGGGGGAACCATGTTTCCCC 
GTCACGTCACCTACTTGGCCGACTATCGGAAGCGGAGGGCCCCCTTGGTACAAAGGGG 
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FIG. 9A 

Se r 1 1 eG 1 uTh r 1 1 eTh rLeuP roG 1 nAspA 1 aVa 1 Se r A rgTh rG 1 n A rg A rgG 1 yA rg 
1 TCCATTGAGACAATCACGCTCCCCCAGGATGCTGTCTCCCGCACTCAACGTCGGGGCAGG 
AGGTAACTCTGTTAGTGCGAGGGGGTCCTACGACAGAGGGCGTGAGTTGCAGCCCCGTCC 

ThrGlyArgGlyLysProGlylleTyrArgPheValAlaProGlyGluArgProSerGly 
61 ACTGGCAGGGGGAAGCCAGGCATCTACAGATTTGTGGCACCGGGGGAGCGCCCCTCCGGC 
TGACCGTCCCCCTTCGGTCCGTAGATGTCTAAACACCGTGGCCCCCTCGCGGGGAGGCCG 

MetPheAspSerSerValLeuCysGluCysTyrAspAlaGlyCysAlaTrpTyrGluLeu 
121 ATGTTCGACTCGTCCGTCCTCTGTGAGTGCTATGACGCAGGCTGTGCTTGGTATGAGCTC 
TAGAAGCTGAGCAGGCAGGAGACACTCACGATACTGCGTCCGACACGAACCATACTCGAG 

Th rProA 1 aG 1 uTh rTh rVa I A rgLeuA rgA 1 aTy rMetAsnThr ProG 1 yLeuProVa 1 
181 ACGCCCGCCGAGACTACAGTTAGGCTAGGAGCGTACATGAACACCCCGGGGCTTCCCGTG 
TGCGGGCGGCTCTGATGTCAATCCGATGCTCGCATGTACTTGTGGGGCCCCGAAGGGCAC 

n , CysG 1 nAspH i sLeuG 1 uPheTrpG 1 uG 1 yVa 1 PheTh rG I yLeuTh rHisI leAspAla 
241 TGCCAGGACCATCTTGAATTTTGGGAGGGCGTCTTTACAGGCCTCACTCATATAGATGCC 
AGGGTCCTGGTAGAACTTAAAACCCTCCCGCAGAAATGTCCGGAGTGAGTATATCTACGG 

HisPheLeuSerGlnThrLysGlnSerGlyGluAsnLeuProTyrLeuValAlaTyrGln 
301 CACTTTCTATCCCAGACAAAGCAGAGTGGGGAGAACCTTCCTTACCTGGTAGCGTACCAA 
GTGAAAGATAGGGTCTGTTTCGTCTCACCCCTCTTGGAAGGAATGGACCATCGCATGGTT 

AlaThrValCysAlaArgAlaGlnAlaProProProSerTrpAspGlnMetTrpLysCys 
361 GCCACCGTGTGCGCTAGGGCTCAAGCCCCTCCCCCATCGTGGGACCAGATGTGGAAGTGT 
CGGTGGCACACGCGATCCCGAGTTCGGGGAGGGGGTAGCACCCTGGTCTACACCTTCACA 

LeuIleArgLeuLysProThrLeuHisGlyProThrProLeuLeuTryArgLeuGlyAla 
421 TTGATTCGCCTCAAGCCCACCCTCGATGGGCCAACACCCCTGCTATACAGACTGGGCGCT 
AACTAAGCGGAGTTCGGGTGGGAGGTACCCGGTTGTGGGGACGATATGTCTGACCCGCGA 

ValGlnAsnGluIleThrLeuThrHisProValThrLysTyrlleMetThrCysNetSer 
481 GTTCAGAATGAAATCACCCTGACGCACCCAGTCACCAAATACATCATGACATGCATGTCG 
CAAGTCTTACTTTAGTGGGACTGCGTGGGTCAGTGGTTTATGTAGTACTGTACGTACAGC 

A 1 a AspLeuG 1 uVa 1 Va 1 ThrSerTh rTr pVa 1 LeuVa 1 G 1 yG 1 y Va 1 LeuA 1 a A 1 a Leu 
541 GCCGACCTGGAGGTCGTCACGAGCACCTGGGTGCTCGTTGGCGGCGTCCTGGCTGCTTTG 
CGGCTGGACCTCCAGCAGTGCTCGTGGACCCACGAGCAACCGCCGCAGGACCGACGAAAC 

A 1 a A 1 aTy r Cy s LeuSe rTh r G 1 y Cy s Va 1 Va 1 1 1 eVa 1 G 1 yA rgVa 1 Va 1 Le uSe rG 1 y 
601 GCCGCGTATTGCCTGTCAACAGGCTGCGTGGTCATAGTGGGCAGGGTCGTCTTGTCCGGG 
CGGCGCATAACGGACAGTTGTCCGACGCACCAGTATCACCCGTCCCAGCAGAACAGGCCC 

LysProA la 1 1 e 1 1 eProAspArgG 1 uVa 1 LeuTy rA rgG 1 uPheAspG 1 uMetG 1 uG 1 u 
661 AAGCCGGCAATCATACCTGACAGGGAAGTCCTCTACCGAGAGTTCGATGAGATGGAAGAG 
TTCGGCCGTTAGTATGGACTGTCCCTTCAGGAGATGGCTCTCAAGCTACTCTACCTTCTC 

CysSerGlnHisLeuProTyrlleGluGlnGlyMetMetLeuAlaGluGlnPheLysGln 
721 TGCTCTCAGCACTTACCGTACATCGAGCAAGGGATGATGCTCGCCGAGCAGTTCAAGCAG 
ACGAGAGTCGTGAATGGCATGTAGCTCGTTCCCTACTACGAGCGGCTCGTCAAGTTCGTC 

LysAlaLeuGlyLeuLeuGlnThrAlaSerArgGlnAlaGluVallleAlaProAlaVal 
781 AAGGCCCTCGGCCTCCTGCAGACCGCGTCCCGTCAGGCAGAGGTTATCGCCCCTGCTGTC 
TTCCGGGAGCCGGAGGACGTCTGGCGCAGGGCAGTCCGTCTCCAATAGCGGGGACGACAG 




FIG. 9B 

n GInThrAsnTrpGlnLysLeuGluThrPheTrpAlaLysHisMetTrpAsnPhel leSer 
841 CAGACCAACTGGCAAAAACTCGAGACCTTCTGGGCGAAGCATATGTGGAACTTCATCAGT 
GTCTGGTTGACCGTTTTTGAGCTCTGGAAGACCCGCTTCGTATACACCTTGAAGTAGTCA 

nn . GlylleGlnTyrLeuAlaGlyLeuSerThrLeuProGl^ 
901 GGGATACAATACTTGGCGGGCTTGTCAACGCTGCCTGGTAACCCCGCCATTGCTTCATTG 
CCCTATGTTATGAACCGCCCGAACAGTTGCGACGGACCATTGGGGCGGTAACGAAGTAAC 

nf , , Me t A 1 aPheThrA 1 aA 1 aVa ITh rSe rProLeuTh rTh rSerG 1 nTh rLeuLeuPheAsn 
961 ATGGCTTTTACAGCTGCTGTCACCAGCCCACTAACCACTAGCCAAACCCTCCTCTTCAAC 
TACCGAAAATGTCGACGACAGTGGTCGGGTGATTGGTGATCGGTTTGGGAGGACAAGTTG 

IleLeuGlyGlyTrpValAlaAlaGlnLeuAlaAlaProGlyAlaAlaThrAlaPheVal 
1021 ATATTGGGGGGGTGGGTGGCTGCCCAGCTCGCCGCCCCCGGTGCCGCTACTGCCTTTGTG 
TATAACCCCCCCACCCACCGACGGGTCGAGCGGCGGGGGCCACGGCGATGACGGAAACAC 

_ GlyAlaGlyLeuAlaGlyAlaAlalleGlySerValGlyLeuGlyLysValLeuIleAsp 
1081 GGCGCTGGCTTAGCTGGCGCCGCCATCGGCAGTGTTGGACTGGGGAAGGTCCTCATAGAC 
CCGCGACCGAATCGACCGCGGCGGTAGCCGTCACAACCTGACCCCTTCCAGGAGTATCTG 

I leLeuAlaGlyTyrGlyAlaGlyValAlaGlyAlaLeuValAlaPheLysI leMetSer 
1141 ATCCTTGCAGGGTATGGCGCGGGCGTGGCGGGAGCTCTTGTGGCATTCAAGATCATGAGC 
TAGGAACGTCCCATACCGCGCCCGCACCGCCCTCGAGAACACCGTAAGTTCTAGTACTCG 

GlyGluValProSerThrGluAspLeuValAsnLeuLeuProAlalleLeuSerProGly 
1201 GGTGAGGTCCCCTCCACGGAGGACCTGGTCAATCTACTGCCCGCCATCCTCTCGCCCGGA 
CCACTCCAGGGGAGGTGCCTCCTGGACCAGTTAGATGACGGGCGGTAGGAGAGCGGGCCT 

AlaLeuValValGlyValValCysAlaAlalleLeuArgArgHisValGlyProGlyGlu 
1261 GCCCTCGTAGTCGGCGTGGTCTGTGCAGCAATACTGCGCCGGCACGTTGGCCCGGGCGAG 
CGGGAGCATCAGCCGCACCAGACACGTCGTTATGACGCGGCCGTGCAACCGGGCCCGCTC 

™, GlyAlaValGlnTrpMetAsnArgLeuIleAlaPheAlaSerArgGlyAsnHisValSer 
1321 GGGGCAGTGCAGTGGATGAACCGGCTGATAGCCTTCGCCTCCCGGGGGAACCATGTTTCCCC 
CCCCGTCACGTCACCTACTTGGCCGACTATCGGAAGCGGAGGGCCCCCTTGGTACAAAGGGG 
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FIG. 1 1 

Overlap with 32 

MetAsnA rgLeu I leA 1 aPheA 1 aSerArgG 1 yAsnH i sVal SerProThrH i sTyrVal 
1 GAT6AACC6GCT6ATA6CCTTC6CCTCCCGG6G6AACCATGTTTCCCCCACGCACTACGT 
CTACTTGGCCGACTATCGGAAGCGGAGGGCCCCCTTGGTACAAAGGGGGTGCGTGATGCA 

Pr oG 1 uSe rAspA 1 aA 1 a A 1 a A rgVa 1 Th r A 1 a 1 1 eLeuSe rSe r LeuTh rVa IThrGln 
61 GCCGGAGAGCGATGCAGCTGCCCGCGTCACTGCCATACTCAGCAGCCTCACTGTAACCCA 
CGGCCTCTCGCTACGTCGACGGGCGCAGTGACGGTATGAGTCGTCGGAGTGACATTGGGT 

LeuLeuArgArgLeuH i sG InTrpI leSerSerG 1 uCysThrThrProCysSe rG 1 ySer 
121 GCTCCTGAGGCGACTGCACCAGTGGATAAGCTCGGAGTGTACCACTCCATGCTCCGGTTC 
CGAGGACTCCGCTGACGTGGTCACCTATTCGAGCCTCACATGGTGAGGTACGAGGCCAAG 

TrpLeuArgAspI leTrpAspTrpI leCysGluValLeuSerAspPheLysThrTrpLeu 
181 CTGGCTAAGGGACATCTGGGACTGGATATGCGAGGTGTTGAGCGACTTTAAGACCTGGCT 
GACCGATTCCCTGTAGACCCTGACCTATACGCTCCACAACTCGCTGAAATTCTGGACCGA 

LysAlaLysLeuMetProGlnLeuProGlyl.leProPheValSerCysGlnArgGlyTyr 
211 AAAAGCTAAGCTCATGCCACAGCTGCCTGGGATCCCCTTTGTGTCCTGCCAGCGCGGGTA 
TTTTCGATTCGAGTACGGTGTCGACGGACCCTAGGGGAAACACAGGACGGTCGCGCCCAT 

LysGlyValTrpArgVal 
301 TAAGGGGGTCTGGCGAGTG 
ATTCCCCCAGACCGCTCAC 
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FIG. 13 

CysSerLeuThrValThrGlnLeuLeuArgArgLeuHisGlnTrpIleSerSerGluCys 
1 ACTGCAGCCTCACTGTAACCCAGCTCCTGAGGCGACTGCACCAGTGGATAAGCTCGGAGT 
TGACGTCGGAGTGACATTGGGTCGAGGACTCCGCTGACGTGGTCACCTATTCGAGCCTCA 

ThrThrProCysSerGlySerTrpLeuArgAspIleTrpAspTrpIleCysGluValLeu 
61 GTACCACTCCATGCTCCGGTTCCTGGCTAAGGGACATCTGGGACTGGATATGCGAGGTGT 
CATGGTGAGGTACGAGGCCAAGGACCGATTCCCTGTAGACCGTGACCTATACGCTCCACA# 

Overlap with 33b 

SerAspPheLysThrTrpLeaLysAlaLysLeuMetProGlnLeuProGlylleProPhe 
1 2 1 TGAGCGACTTTAAG ACCTGGCTAAAAGCTAAGCTCATGCCACAGCTGCCTGGGATCCCCT 
ACTCGCTGAAATTCTGGACCGATTTTCGATTCGAGTACGGTGTCGACGGACCCTAGGGGA 



ValSerCysGlnArgGlyTyrLysGlyValTrpArgGlyAspGlylleMetHlsThrArg 
1 8 1 TTGTGTCCTGCCAGCGCGGGTATAAGGGGGTCTGGCG AGGGG ACGGCATCATGC ACACTC 
AACACAGGACGGTCGCGCCCATATTCCCCCAGACCGCTCCCCTGCCGTAGTACGTGTGAG 

CysHisCysGlyAlaGluileThrGlyHisValLysAsnGlyThrMetArglleValGly 

2 4 1 GCTGCCACTGTGGAGCTGAG ATC ACTGG AC ATGTGAAAAACGGG ACG ATGAGGATCGTCG 
CGACGGTGACACCTCGACTCTAGTGACCTGTACAGTTTTTGCCCTGCTACTCCTAGCAGC 

ProArgThrCysArgAsnMetTrpSerGlyThrPheProIleAsnAlaTyrThrThrGly 
301 GTCCTAGGACCTGCAGGAACATGTGGAGTGGGACCTTCCCCATTAATGCCTACACCACGG 
CAGGATCCTGGACGTCCTTGTACACCTCACCCTGGAAGGGGTAATTACGGATGTGGTGCC 

ProCysThrProLeuProAlaProAsnTyrThrPheAlaLeuTrpArgValSerAlaGlu 
361 GCCCCTQTACCCCCCTTCCTGCGCCG AACTACACGTTCGCGCTATGGAGGGTGTCTGC AG 
CGGGGACATGGGGGGAAGGACGCGGCTTGATGTGCAAGCGCGATACCTCCCACAGACGTC 

GluTyrValGluIleArgGlnValGlyAspPheHisTyrValThrGlyKetThrThrAsp 
4 21 AGGMTATGTGGAGATAAGGCAGGTGGGGGACTTCCACTACGTGACGGGTATGACTACTG 
TCCTTATACACCTCTATTCCGTCCACCCCCTGAAGGTGATGCACTGCCCATACTGATGAC 

AsnLeuLysCysProCysGlnValProSerProGluPhePheThrGlu 
4 8 1 AC AATCTCAAATGCCCGTGCCAGGTCCCATCGCCCGAATTTTTC^AGAAT 
TGTTAGAGTTTACGGGCACGGTCCAGGGTAGCGGGCTTAAAAAGTGTCTTA 



FIG. 14A 

AlaTyrMetSerLysAlaHisGlyl leAspProAsril leArgthrGiyValArgThrl le 
1 TGCTTACAT6TCCAAGGCTCATGGGATCGATCCTAACATCAGGACCGGGGTGAGAACAAT 
ACGAATGTACAGGTTCCGAGTACCCTAGCTAGGATTGTAGTCCTGGCCCCACTCTTGTTA 

ThrThrGlySerProIleThrTyrSerThrTyrGlyLysPheLeuAlaAspGlyGlyCys 
61 TACCACTGGCAGCCCCATCACGTACTCCACCTACGGCAAGTTCCTTGCCGACGGCGGGTG 
ATGGTGACCGTCGGGGTAGTGCATGAGGTGGATGCCGTTCAAGGAACGGCTGCCGCCCAC . 

SerGlyGlyAlaTyrAspIlellelleCysAspGluCysHisSerThrAspAlaThrSer 
121 CTCGGGGGGCGCTTATGACATAATAATTTGTGACGAGTGCCACTCCACGGATGCCACATC 
GAGCCCCCCGCGAATACTGTATTATTAAACACTGCTCACGGTGAGGTGCCTACGGTGTAG 

IleLeuGlylleGlyThrValLeuAspGlnAlaGluThrAlaGlyAlaArgLeuValVal 
181 CATCTTGGGCATCGGCACTGTCCTTGACCAAGCAGAGACTGCGGGGGCGAGACTGGTTGT 
GTAGAACCCGTAGCCGTGACAGGAACTGGTTCGTCTCTGACGCCCCCGCTCTGACCAACA 

LeuAlaTh rA 1 aTh r ProProG 1 ySe rVa ITh rVa 1 ProH i sProAsn I leGluGluVal 
241 GCTCGCCACCGCCACCCCTCCGGGCTCCGTCACTGTGCCCCATCCCAACATCGAGGAGGT 
CGAGCGGTGGCGGTGGGGAGGCCCGAGGCAGTGACACGGGGTAGGGTTGTAGCTCCTCCA 

A 1 aLeuSe rTh rThrGlyGluIl eProPheTy rG lyLysA 1 a 1 1 eProLeuG 1 uVa 1 1 1 e 
301 TGCTCTGTCCACCACCGGAGAGATCCCTTTTTACGGCAAGGCTATCCCCCTCGAAGTAAT 
ACGAGACAGGTGGTGGCCTCTCTAGGGAAAAATGCCGTTCCGATAGGGGGAGCTTCATTA 

LysGlyGlyArgHisLeuIlePheCysHisSerLysLysLysCysAspGluLeuAlaAla 
361 CAAGGGGGGGAGACATCTCATCTTCTGTCATTCAAAGAAGAAGTGCGACGAACTCGCCGC 
GTTCCCCCCCTCTGTAGAGTAGAAGACAGTAAGTTTCTTCTTCACGCTGCTTGAGCGGCG 

LysLeuValAlaLeuGlylleAsnAlaValAlaTyrTyrArgGlyLeuAspValSerVal 
421 AAAGCTGGTCGCATTGGGCATCAATGCCGTGGCCTACTACCGCGGTCTTGACGTGTCCGT 
TTTCGACCAGCGTAACCCGTAGTTACGGCACCGGATGATGGCGCCAGAACTGCACAGGCA 

IleProThrSerGlyAspValValValValAlaThrAspAlaLeuMetThrGlyTyrThr 
181 CATCCCGACCAGCGGCGATGTTGTCGTCGTGGCAACCGATGCCCTCATGACCGGCTATAC 
GTAGGGCTGGTCGCCGCTACAACAGCAGCACCGTTGGCTACGGGAGTACTGGCCGATATG 

G 1 yAspPheAspSe rVa 1 1 1 eAspTy r AsnTh rCysVa IThrG 1 nThrVa 1 AspPheSer 
541 CGGCGACTTCGACTCGGTGATAGACTACAATACGTGTGTCACCCAGACAGTCGATTTCAG 
GCCGCTGAAGCTGAGCCACTATCTGATGTTATGCACACAGTGGGTCTGTCAGCTAAAGTC 

LeuAspProThrPheThrlleGluThrlleThrLeuProGlnAspAlaValSerArgThr 
601 CCTTGACCCTACCTTCACCATTGAGACAATCACGCTCCCCCAGGATGCTGTCTCCCGCAC 
GGAACTGGGATGGAAGTGGTAACTCTGTTAGTGCGAGGGGGTCCTACGACAGAGGGCGTG 

GlnArgArgGlyArgThrGlyArgGlyLysProGlylleTyrArgPheValAlaProGly 
661 TCAACGTCGGGGCAGGACTGGCAGGGGGAAGCCAGGCATCTACAGATTTGTGGCACCGGG 
AGTTGCAGCCCCCTCCTGACCGTCCCCCTTCGGTCCGTAGATGTCTAAACACCGTGGCCC 

GluArgProSerGlyMetPheAspSerSerValLeuCysGluCysTyrAspAlaGlyCys 
721 GGAGCGCCCCTCCGGCATGTTCGACTCGTCCGTCCTCTGTGAGTGCTATGACGCAGGCTG 
CCTCGCGGGGAGGCCGTACAAGCTGAGCAGGCAGGAGACACTCACGATACTGCGTCCGAC 

AlaTrpTyrGluLeuThrProAlaGluThrThrValArgLeuArgAlaTyrMetAsnThr 
781 TGCTTGGTATGAGCTCACGCCCGCCGAGACTACAGTTAGGCTACGAGCGTACATGAACAC 
ACGAACGATACTCGAGTGCGGGCGGeTCTGATGTCAATCCGATGCTCGCATGTACTTGTG 

ProG lyLeuProVa 1 CysG 1 nAspH I sLeuG 1 uPheTrpG 1 uG 1 y Va 1 PheThrG 1 y Leu 
811 CCCGGGGCTTCCCGTGTGCCAGGACCATCTTGAATTTTGGGAGGGCGTCTTTACAGGCCT 
GGGCCCCGAAGGGCACACGGTCCTGGTAGAACTTAAAACCCTCCCGCAGAAATGTCCGGA 



FIG. 14B 

ThrHisI 1 eAspA laHi sPheLeuSe rG 1 nTh r LysG InSerGlyG 1 uAspLeuProTy r 
901 CACTCATATAGATGCCCACTTTCTATCCCAGACAAAGCAGAGTGGGGAGAACCTTCCTTA 
GTGAGTATATCTACGGGTGAAAGATAGGGTCTGTTTCGTCTCACCCCTCTTGGAAGGAAT 

LeuValAlaTyrGlnAlaThrValCysAlaArgAlaGlnAlaProProProSerTrpAsp 
961 CCTGGTAGCGTACCAAGCCACCGTGTGCGCTAGGGCTCAAGCCCCTCCCCCATCGTGGGA 
GGACCATCGCATGGTTCGGTGGGACACGCGATCCCGAGTTCGGGGAGGGGGTAGCACCCT 

GInMetTrpLysCysLeuIleArgLeuLysProThrLeuHisGlyProThrProLeuLeu 
1021 CCAGATGTGGAAGTGTTTGATTCGCCTCAAGCCCACCCTCCATGGGCCAACACCCCTGCT 
GGTCTACACCTTCACAAACTAAGCGGAGTTCGGGTGGGAGGTACCCGGTTGTGGGGACGA 

TyrArgLeuGlyAlaValGlnAsnGluIleThrLeuThrHisProValThrLysTyrlle 
1081 ATACAGACTGGGCGCTGTTCAGAATGAAATCACCCTGACGCACCCAGTCACCAAATACAT 
TATGTCTGACCCGCGACAAGTCTTACTTTAGTGGGACTGCGTGGGTCAGTGGTTTATGTA 

Me tTh r CysMe tSe r A 1 a AspLeuG 1 uVa 1 Va 1 Th r Se rTh rTrpVa 1 LeuVa lGlyGly 
1141 CATGACATGCATGTC6GCCGACCTGGAGGTCGTCACGAGCACCTGGGTGCTCGTTGGCGG 
GTACTGTACGTACAGCCGGCTGGACCTCCAGCAGTGCTCGTGGACCCACGAGCAACCGCC 

ValLeuAlaAlaLeuAlaAlaTyrCysLeuSerThrGlyCysValVallleValGlyArg 
1201 CGTCCTGGCTGCTTTGGCCGCGTATTGCCTGTCAACAGGCTGCGTGGTCATAGTGGGCAG 
GCAGGACCGACGAAACCGGCGCATAACGGACAGTTGTCCGACGCACCAGTATCACCCGTC 

ValValLeuSerGlyLysProAlallelleProAspArgGluValLeuTyrArgGluPhe 
1261 GGTCGTCTTGTCCGGGAAGCCGGCAATCATACCTGACAGGGAAGTCCTCTACCGAGAGTT 
CCAGCAGAACAGGCCCTTCGGCCGTTAGTATGGACTGTCCCTTCAGGAGATGGCTCTCAA 

AspG 1 uMetG 1 uG 1 uCysSe rG 1 nH i sLeuProTy r I leGluGlnGl yMetMetLeuA la 
1321 CGATGAGATGGAAGAGTGCTCTCAGCACTTACCGTACATCGAGCAAGGGATGATGCTCGC 
GCTACTCTACCTTCTCACGAGAGTCGTGAATGGCATGTAGCTCGTTCCCTACTACGAGCG 

GluGlnPheLysGlnLysAlaLeuGlyLeuLeuGlnThrAlaSerArgGlnAlaGluVal 
1381 CGAGCAGTTCAAGCAGAAGGCCCTCGGCCTCCTGCAGACCGCGTCCCGTCAGGCAGAGGT 
GCTCGTCAAGTTCGTCTTCCGGGAGCCGGAGGACGTCTGGCGCAGGGCAGTCCGTCTCCA 

IleAlaProAlaValGlnThrAsnTrpGlnLysLeuGluThrPheTrpAlaLysHisMet 
im TATCGCCCCTGCTGTCCAGACCAACTGGCAAAAACTCGAGACCTTCTGGGCGAAGCATAT 
ATAGCGGGGACGACAGGTCTGGTTGACCGTTTTTGAGCTCTGGAAGACCCGCTTCGTATA 

TrpAsnPhel leSerGlyl leGlnTyrLeuAlaGlyLeuSerThrLeuProGlyAsnPro 
1501 GTGGAACTTCATCAGTGGGATACAATACTTGGCGGGCTTGTCAACGCTGCCTGGTAACCC 
CACCTTGAAGTAGTCACCCTATGTTATGAACCGCCCGAACAGTTGCGACGGACCATTGGG 

AlalleAlaSerLeuMetAlaPheThrAlaAlaValThrSerProLeuThrThrSerGln 
1561 CGCCATTGCTTCATTGATGGCTTTTACAGCTGCTGTCACCAGCCCACTAACCACTAGCCA 
GCGGTAACGAAGTAACTACCGAAAATGTCGACGACAGTGGTCGGGTGATTGGTGATCGGT 

ThrLeuLeuPheAsnlleLeuGlyGlyTrpValAlaAlaGlnLeuAlaAlaProGlyAla 
1621 AACCCTCCTCTTCAACATATTGGGGGGGTGGGTGGCTGCCCAGCTCGCCGCCCCCGGTGC 
TTGGGAGGAGAAGTTGTATAACCCCCCCACCCACCGACGGGTCGAGCGGCGGGGGCCACG 

AlaThrAlaPheValGlyAlaGlyLeuAlaGlyAlaAlalleGlySerValGlyLeuGly 
1681 CGCTACTGCCTTTGTGGGCGCTGGCTTAGCTGGCGCCGCCATCGGCAGTGTTGGACTGGG 
GCGATGACGGAAACACCCGCGACCGAATCGACCGCGGCGGTAGCCGTCACAACCTGACCC 



FIG. 14C 

, ^LysyalLeuIleAspIleLeuAlaGlyTyrGiyAlaGlyValAlaGlyAlaL^^^ 
1741 GAAGGTCCTCATAGACATCCTTGCAGGGTATGGCGCGGGCGTGGCGGGAGCTCTTGTGGC 
CTTCCAGGAGTATCTGTAGGAACGTCCCATACCGCGCCCGCACCGCCCTCGAGAACACCG 

PheLys 1 1 eMetSerG 1 yG 1 uVa 1 ProSerTh rG 1 uAspLeuVa 1 AsnLeuLeuProA 1 a 
1801 ATTCAAGATCATGAGCGGTGAGGTCCCCTCCACGGAGGACCTGGTCAATCTACTGCCCGC 
TAAGTTCTAGTACTCGCCACTCCAGGGGAGGTGCCTCCTGGACCAGTTAGATGACGGGCG 

1 1 eLeuSe rProG 1 yA 1 aLeuVa 1 Va 1 G 1 yVa 1 Va 1 Cys A 1 aA 1 a 1 1 eLeuA rgA rgH i s 
1861 CATCCTCTCGCCCGGAGCCCTCGTAGTCGGCGTGGTCTGTGCAGCAATACTGCGCCGGCA 
GTAGGAGAGCGGGCCTCGGGAGCATCAGCCGCACCAGACACGTCGTTATGACGCGGCCGT 

ValGlyProGlyGluGlyAlaValGlnTrpMetAsnArgLeuIleAlaPheAlaSerArg 
1921 CGTTGGCCCGGGCGAGGGGGCAGTGCAGTGGATGAACCGGCTGATAGCCTTCGCCTCCCG 
GCAACCGGGCCCGCTCCCCCGTCACGTCACCTACTTGGCCGACTATCGGAAGCGGAGGGC 

GlyAsnHisValSerProThrHisTyrYalProGluSerAspAlaAlaAlaArgValThr 
1981 GGGGAACCATGTTTCCCCCACGCACTACGTGCCGGAGAGCGATGCAGCTGCCCGCGTCAC 
CCCCTTGGTACAAAGGGGGTGCGTGATGCACGGCCTCTCGCTACGTCGACGGGCGCAGTG 

AlalleLeuSerSerLeuThrValThrGlnLeuLeuArgArgLeuHisGlnTrpIleSer 
2041 TGCCATACTCAGCAGCCTCACTGTAACCCAGCTCCTGAGGCGACTGCACCAGTGGATAAG 
ACGGTATGAGTCGTCGGAGTGACATTGGGTCGAGGACTCCGCTGACGTGGTCACCTATTC 

SerGluCysThrThrProCysSerGlySerTrpLeuArgAspIleTrpAspTrpIleCys 
2101 CTCGGAGTGTACCACTCCATGCTCCGGTTCCTGGCTAAGGGACATCTGGGACTGGATATG * 
GAGCCTCACATGGTGAGGTACGAGGCCAAGGACCGATTCCCTGTAGACCCTGACCTATAC 

GluValLeuSerAspPheLysThrTrpLeuLysAlaLysLeuMetProGlnLeuProGly 
2161 CGAGGTGTTGAGCGACTTTAAGACCTGGCTAAAAGCTAAGCTCATGCCACAGCTGCCTGG 
GCTCCACAACTCGCTGAAATTCTGGACCGATTTTCGATTCGAGTACG6TGTCGACGGACC 

IleProPheValSerCysGlnArgGlyTyrLysGlyValTrpArgValAspGlylleMet 
2221 GATCCCCTTTGTGTCCTGCCAGCGCGGGTATAAGGGGGTCTGGCGAGTGGACGGCATCAT 
CTAGGGGAAACACAGGACGGTCGCGCCCATATTCCCCCAGACCGCTCACCTGCCGTAGTA 

HisThrArgCysHiseysGlyAlaGluIleThrGlyHisValLysAsnGlyThrMetArg 
2281 GCACACTCGCTGCCACTGTGGAGCTGAGATCACTGGACATGTCAAAAACGGGACGATGAG 
CGTGTGAGCGACGGTGACACCTCGACTCTAGTGACCTGTACAGTTTTTGCCCTGCTACTC 

IleValGlyProArgThrCysArgAsnMetTrpSerGlyThrPheProIleAsnAlaTyr 
2341 GATCGTCGGTCCTAGGACCTGCAGGAACATGTGGAGTGGGACCTTCCCCATTAATGCCTA 
CTAGCAGCCAGGATCCTGGACGTCCTTGTACACCTCACCCTGGAAGGGGTAATTACGGAT 

ThrThrG 1 yProCysTh rProLeuProA 1 aProAsnTy rThrPheA 1 aLeuTrpA rgVa 1 
2401 CACCACGGGCCCCTGTACCCCCCTTCCTGCGCCGAACTACACGTTCGCGCTATGGAGGGT 
GTGGTGCCCGGGGACATGGGGGGAAGGACGCGGCTTGATGTGCAAGCGCGATACCTCCCA 

Se r A 1 aG 1 uG 1 uTy rVa 1 G 1 u 1 1 e A rgG I nVa 1 G 1 yA spPheH 1 sTy rVa 1 Th rG 1 yMe t 
2461 GTCTGCAGAGGAATATGTGGAGATAAGGCAGGTGGGGGACTTCCACTACGTGACGGGTAT 
CAGACGTCTCCTTATACACCTCTATTCCGTCCACCCCCTGAAGGTGATGCACTGCCCATA 

ThrThrAspAsnLeuLysCysProCysGlnValProSerProGluPhePheThrGlu 
2521 GACTACTGACAATCTCAAATGCCCGTGCCAGGTCCCATCGCCCGAATTTTTCACAGAAT 
CTGATGACTGTTAGAGTTTACGGGCACGGTCCAGGGTAGCGGGCTTAAAAAGTGTCTTA 



FIG. 15 

AlaVa 1 AspPhe I leProVa 1 G 1 uAsnLeuG 1 uTh rTh rMetArgSe rProVa 1 PheTh r 
1 GGCGGTGGACTTTATCCCTGTGGAGAACCTAGAGACAACCATGAGGTCCCCGGTGTTCAC 
CCGCCACCTGAAATAGGGACACCTCTTGGATCTCTGTTGGTACTCCAGGGGCCACAAGTG 

AspAsnSerSerProProValValProGlnSerPheGlnValAlaHisLeuHisAlaPro 
61 GGATAACTCCTCTCCACCAGTAGTGCCCCAGAGCTTCCAGGTGGCTCACCTCCATGCTCC 
CCTATTGAGGAGAGGTGGTCATCACGGGGTCTCGAAGGTCCACCGAGTGGAGGTACGAGG 

ThrGlySerGlyLysSerThrLysValProAlaAlaTyrAlaAlaGlnGlyTyrLysVal 
121 CACAGGCAGCGGCAAAAGCACCAAGGTCCCGGCTGCATATGCAGCTCAGGGCTATAAGGT 
GTGTCCGTCGCCGTTTTCGTGGTTCCAGGGCCGACGTATACGTCGAGTCCCGATATTCCA 



LeuVa 1 LeuAsnProSe rVa 1 A I aA laTh rLeuG 1 yPheG lyA 1 aTy rMetSerLysA la 
181 GCTAGTACTCAACCCCTCT6TTGCTGCAACACTGGGCTTTGGTGCTTACATGTCCAAGGC 
CGATCATGAGTTGGGGAGACAACGACGTTGTGACCCGAAACCACGAATGTACAGGTTCCG 

Overlap with 40b 

HisGlyl leAspProAsnl leArgThrGlyValArgThrl leThrThrGlySerProI le 
241 TCATGGGATCGATCCTAACATCAGGACCGGGGTGAGAACAATTACCACTGGCAGCCCCAT 
AGTACCCTAGCTAGGATTGTAGTCCTGGCCCCACTCTTGTTAATGGTGACCGTCGGGGTA 

Th rTy r Se rTh rTy rG 1 y Ly sPheLeuA 1 aAspG 1 yG 1 yCy sSe rG 1 yG 1 yA 1 aTy rAsp 
301 CACGTACTCCACCTACGGCAAGTTCCTTGCCGACGGCGGGTGCTCGGGGGGCGCTTATGA 
GTGCATGAGGTGGATGCCGTTCAAGGAACGGCTGCCGCCCACGAGCCCCCCGCGAATACT 

IlellelleCysAspGluCysHisSerThrAspAlaThrSerlleLeuGlylleGlyThr 
361 CATAATAATTTGTGACGAGTGCCACTCCACGGATGCCACATCCATCTTGGGCATTGGCAC 
GTATTATTAAACACTGCTCACGGTGAGGTGCCTACGGTGTAGGTAGAACCCGTAACCGTG 

Va 1 LeuAspG 1 n A 1 aG luTh rA 1 aG 1 yA 1 aA rgLeuVa 1 Va 1 LeuA 1 aTh r A 1 aTh r Pro 
421 TGTCCTTGACCAAGCAGAGACTGCGGGGGCGAGACTGGTTGTGCTCGCCACCGCCACCCC 
ACAGGAACTGGTTCGTCTCTGACGCCCCCGCTCTGACCAACACGAGCGGTGGCGGTGGGG 

ProG 1 ySe rVa 1 Th rVa 1 ProH i sProAsn I leG 1 uG 1 uVa 1 A 1 aLeuSerTh rTh rG 1 y 
481 TCCGGGCTCCGTCACTGTGCCCCATCCCAACATCGAGGAGGTTGCTCTGTCCACCACCGG 
AGGCCCGAGGCAGTGACACGGGGTAGGGTTGTAGCTCCTCCAACGAGACAGGTGGTGGCC 

GluIleProPheTyrGlyLysAlalleProLeuGluVallleLysGlyGlyArgHisLeu 
541 AGAGATCCCTTTTTACGGCAAGGCTATCCCCCTCGAAGTAATCAAGGGGGGGAGACATCT 
TCTCTAGGGAAAAATGCCGTTCCGATAGGGGGAGCTTCATTAGTTCCCCCCCTCTGTAGA 

I lePheCysHi sSerLysLysLysCysAspG 1 uLeuA laA laLysLeuVa 1 A laLeuG ly 
601 CATCTTCTGTCATTCAAAGAAGAAGTGCGACGAACTCGCCGCAAAGCTGGTCGCATTGGG 
GTAGAAGACAGTAAGTTTCTTCTTCACGCTGCTTGAGCGGCGTTTCGACCAGCGTAACCC 

1 1 eAsnA 1 aVa 1 A 1 aTy rTy r A rgG 1 yLeuAspVa 1 Se rVa 1 1 1 eProTh rSe rG 1 yAsp 
661 CATCAATGCCGTGGCCTACTACCGCGGTCTTGACGTGTCCGTCATCCCGACCAGCGGCGA 
GTAGTTACGGCACCGGATGATGGCGCCAGAACTGCACAGGCAGTAGGGCTGGTCGCCGCT 

ValValValValAlaThrAspAlaLeuMetThrGlyTyrThrGlyAspPheAspSerVal 
721 TGTTGTCGTCGTGGCAACCGATGCCCTCATGACCGGCTATACCGGCGACTTCGACTCGGT 
ACAACAGCAGCACCGTTGGCTAC6GGAGTACTGGCCGATATGGCCGCTGAAGCTGAGCCA 

IleAspCysAsnThrCys 
781 GATAGACTGCAATACGTGTG 
CTATCTGACGTTATGCACAC 



FIG. 16 

ProCysThrCysGlySerSerAspLeuTyrLeuValThrArgHisAlaAspValllePro 
1 CTCCCTGCACTTGCGGCTCCTCGGACCTTTACCTGGTCACGAGGCACGCCGATGTCATTC 
GAGGGACGTGAACGCCGAGGAGCCTGGAAATGGACCAGTGCTCCGTGCGGCTACAGTAAG 

ValArgArgArgGlyAspSerArgGlySerLeuLeuSerProArgProIleSerTyrLeu 
61 CCGTGCGCCGGCGGGGTGATAGCAGGGGCAGCCTGCTGTCGCCCCGGCCCATTTCCTACT 
GGCACGCGGCCGCCCCACTATCGTCCCCGTCGGACGACAGCGGGGCCGGGTAAAGGATGA 

LysGlySerSerGlyGlyProLeuLeuCysProAlaGlyHisAlaValGlyllePheArg 
121 TGAAAGGCTCCTCGGGGGGTCCGCTGTTGTGCCCCGCGGGGCACGCCGTGGGCATATTTA 
ACTTTCCGAGGAGCCCCCCAGGCGACAACACGGGGCGCCCCGTGCGGCACCCGTATAAAT 

Overlap with 

AlaAlaValCysThrArgGlyValAlaLysAlaValAspPhelleProValGluAsnLeu 
181 GGGCCGCGGTGTGCACCCGTGGAGTGGCTAAGGCGGTGGACTTTATCCCTGTGGAGAACC 
CCCGGCGCCACACGTGGGCACCTCACCGATTCCGCCACCTGAAATAGGGACACCTCTTGG 

33c _ - 

G 1 uTh rTh rMetA rgSe rProVa 1 PheThr AspAsnSer 
2m TAGAGACAACCATGAGGTCCCCGGTGTTCACGGATAACTCCTC 
ATCTCTGTTGGTACTCCAGGGGCCACAAGTGCCTATTGAGGAG 



FIG. 17 



GlyTrpArgLeuLeuAlaProIleThrAlaTyrAlaGlnGlnThrArgGlyLeuLeuGly 
1 GGGGTGGAGGTTGCTGGCGCCCATCACGGCGTACGCCCAGCAGACAAGGGGCCTCCTAGG 
CCCCACCTCCAACGACCGCGGGTAGTGCCGCATGCGGGTCGTCTGTTCCCCGGAGGATCC 



AspLysAsnGlnvalGluGlyGluvalGlnlle 
GACAAAAACCAAGTGGAGGGTG 



CysIlelleThrSerLeuThrGlyAr^ ^i^***^.^ 

61 GTGCATAATCACCAGCCTAACTGGCCG6GACAAAAACCAAGTGGAGGGTGAGGTCCAGAT 
CACGTATTAGTGGTCGGATTGACCGGCCCTGTTTTTGGTTCACCTCCCACTCCAGGTCTA 



ValSerThrAlaAlaGlnThrPheLeuAlaThrCysIleAsnGlyValCysTrpThrVal 
121 TGTGTCAACTGCTGCCCAAACCTTCCTGGCAACGTGCATCAATGGGGTGTGCTGGACTGT 
ACACAGTTGACGACGGGTTTGGAAGGACCGTTGCACGTAGTTACCCCACACGACCTGACA 

TyrHisGlyAlaGlyThrArgThrlleAlaSerProLysGlyProVallleGlnMetTyr 
181 CTACCACGGGGCCGGAACGAGGACCATCGCGTCACCCAAGGGTCCTGTCATCCAGATGTA 
GATGGTGCCCCGGCCTTGCTCCTGGTAGCGCAGTGGGTTCCCAGGACAGTAGGTCTACAT 

ThrAsnValAspGlnAspLeuValGlyTrpProAlaProGlnGlySerArgSerLeuThr 
241 TACCAATGTAGACCAAGACCTTGTGGGCTGGCCCGCTCCGCAAGGTAGCCGCTCATTGAC 
ATGGTTACATCTGGTTCTGGAACACCCGACCGGGCGAGGCGTTCCATCGGCGAGTAACTG 

Overlap with 8h 

ProCysThrCysGlySerSerAspLeuTyrLeuValThrArgHis 
301 ACCCTGCACTTGCGGCTCCTCGGACCTTTACCTGGTCACGAGGCACG 
TGGGACGTGAACGCCGAGGAGCCTGGAAATGGACCAGTGCTCCGTGC 



FIG. 18 



AsnMetTrpSer61yThrPheProIleAsnAlaTyrThrThrGlyProCysThrProLeu 
1 GAACATGTGGAGTGGGACCTTCCCCATTAATGCCTACACCACGGGCCCCT6TACCCCCCT 
CTTGTACACCTCACCCTGGAAGGGGTAATTACGGATGTGGTGCCCGGGGACATGGGGGGA 

Overlap with 25c 

ProAlaProAsnTyrThrPheAlaLeuTrpArgValSerAlaGluGluTyrValGluIle 
61 TCCTGCGCCGAACTACACGTTCGCGCTATGGAGGGTGTCTGCAGAGGAATACGTGGAGAT 
AGGACGCGGCTTGATGTGCAAGCGCGATACCTCCCACAGACGTCTCCTTATGCACCTCTA 

ArgG InVa 1 GlyAspPheH i sTy rva lTfirG 1 yMetThrThrAspAsnLeuLysCysPro 
121 AAGGCAGGTGGGGGACTTCCACTACGTGACGGGTATGACTACTGACAATCTTAAATGCCC 
TTCCGTCCACCCCCTGAAGGTGATGCACTGCCCATACTGATGACTGTTAGAATTTACGGG 

CysGlnValProSerProGluPhePheThrGluLeuAspGlyValArgLeuHisArgPhe 
181 GTGCCAGGTCCCATCGCCCGAATTTTTCACAGAATTGGACGGGGTGCGCCTACATAGGTT 
CACGGTCCAGGGTAGCGGGCTTAAAAAGTGTCTTAACCTGCCCCACGCGGATGTATCCAA 

A 1 aProProCysLysProLeuLeuA rgG 1 uG 1 uVa lSe rPheA rgva 1 G 1 yLeuHi sG 1 u 
241 TGCGCCCCCCTGCAAGCCCTTGCTGCGGGAGGAGGTATCATTCAGAGTAGGACTCCACGA 
ACGCGGGGGGACGTTCGGGAACGACGCCCTCCTCCATAGTAAGTCTCATCCTGAGGTGCT 

™, TyrProValGlySerGlnLeuProCysGluProGluProAspValAlaValLeuThrSer 
301 ATACCCGGTAGGGTCGCAATTACCTTGCGAGCCCGAACCGGACGTGGCCGTGTTGACGTC 
TATGGGCCATCCCAGCGTTAATGGAACGCTCGGGCTTGGCCTGCACCGGCACAACTGCAG 

„ MetLeuThrAspProSerHisIleThrAlaGluAlaAlaGlyArgArgLeuAlaArgGly 
361 CATGCTCACTGATCCCTCCCATATAACAGCAGAGGCGGCCGGGCGAAGGTTGGCGAG6GG 
GTACGAGTGACTAGGGAGGGTATATTGTCGTCTCCGCCGGCCCGCTTCCAACCGCTCCCC 

in SerProProSerValAlaSerSerSerAlaSerGlnLeuSerAlaProSerLeuLysAla 
421 ATCACCCCCCTCTGTGGCCAGCTCCTCGGCTAGCCAGCTATCCGCTCCATCTCTCAAGGC 
TAGTGGGGGGAGACACCGGTCGAGGAGCCGATCGGTCGATAGGCGAGGTAGAGAGTTCCG 

ThrCysThrAlaAsnHisAspSerProAsp 
481 AACTTGCACCGCTAACCATGACTCCCCTGAT 
TTGAACGTGGCGATTGGTACTGAGGGGACTA 



FIG. 19 

-Overlap with 14c 

SerSerSerAlaSerGlnLeuSerAlaProSerLeuLysAlaThrCysThrAlaAspHis 
1 A6CTCCTCGGCTAGCCA6CTATCCGCTCCATCTCTCAA6GCAACTTGCACC6CTAACCAT 
TCGAGGAGCCGATCGGTCGATAGGCGAGGTAGAGAGTTCCGTTGAACGTGGCGATTGGTA 



AspSerProAspAlaGluLeuIleGluAlaAsnLeuLeuTrpArgGlnGluMetGlyGlu 
61 GACTCCCCTGATGCTGAGCTCATAGAGGCCAACCTCCTATGGAGGCAGGAGATGGGCGGC 
CTGAGGGGACTACGACTCGAGTATCTCCGGTTGGAGGATACCTCCGTCCTCTACCCGCCG 

AsnlleThrArgValGluSerGluAsnLysValVallleLeuAspSerPheAspProLeu 
121 AACATCACCAGGGTTGAGTCAGAAAACAAAGTGGTGATTCTGGACTCCTTCGATCCGCTT 
TTGTAGTGGTCCCAACTCAGTCTTTTGTTTCACCACTAAGACCTGAGGAAGCTAGGCGAA 

ValAlaGluGluAspGluArgGluI leSerValProAlaGluI leLeuArgLysSerArg 
181 GTGGCGGAGGAGGACGAGCGGGAGATCTCCGTACCCGCAGAAATCCTGCGGAAGTCTCGG 
CACCGCCTCCTCCTGCTCGCCCTCTAGAGGCATGGGCGTCTTTAGGACGCCTTCAGAGCC 

ArgPheAlaGlnAlaLeuProValTrpAlaArgProAspTyrAsnProProLeuValGlu 
211 AGATTCGCCCAGGCCCTGCCCGTTTGGGCGCGGCCGGACTATAACCCCCCGCTAGTGGAG 
TCTAAGCGGGTCCGGGACGGGCAAACCCGCGCCGGCCTGATATTGGGGGGCGATCACCTC 

ThrTrpLysLysProAspTyrGluProProValValHisGlyCysProLeuProProPro 
301 ACGTGGAAAAAGCCCGACTACGAACCACCTGTGGTCCATGGCTGTCCGCTTCCACCTCCA 
TGCACCTTTTTCGGGCTGATGCTTGGTGGACACCAGGTACCGACAGGCGAAGGTGGAGGT 

LysSerProProvalPro 
361 AAGTCCCCTCCTGTGCCG 
TTCAGGGGAGGACACGGC 



FIG. 20 



ValTrpAlaArgProAspTyrAsnProProLeuValGluThrTrpLysLysProAspTyr 
1 CGTTTGGGCGCGGCCGGACTATAACCCCCCGCTAGTGGAGACGTGGAAAAAACCCGACTA 
GCAAACCCGCGCCGGCCTGATATTGGGGGGCGATCACCTCTGCACCTTTTTTGGGCTGAT 

Overlap with 8f 

GluProProValValHisGlyCysProLeuProProProLysSerProProValProPro 
61 CGAACCACCTGTGGTCCATGGCTGCCCGCTTCCACCTCCAAAGTCCCCTCCTGTGCCTCC 
GCTTGGTGGACACCAGGTACCGACGGGCGAAGGTGGAGGTTTCAGGGGAGGACACGGAGG 

ProA rgLy sLys A rgTh rVa 1 Va 1 LeuTh rG 1 uSe rTh r LeuSe rTh r A 1 aLeuA 1 aG 1 u 
121 GCCTCGGAAGAAGCGGACGGTGGTCCTCACTGAATCAACCCTATCTACTGCCTTGGCCGA 
CGGAGCCTTCTTCGCCTGCCACCAGGAGTGACTTAGTTGGGATAGATGACGGAACCGGCT 

LeuAlaThrArgSerPheGlySerSerSerThrSerGlylleThrGlyAspAsnThrThr 
181 GCTCGCCACCAGAAGCTTTGGCAGCTCCTCAACTTCCGGCATTACGGGCGACAATACGAC 
CGAGCGGTGGTCTTCGAAACCGTCGAGGAGTT6AAGGCCGTAATGCCCGCTGTTATGCTG 

Th rSerSerG 1 uP roA 1 aProSerG lyCysProProAspSe rAspA 1 aG 1 uSerPhe 
211 AACATCCTCTGAGCCCGCCCCTTCTGGCTGCCCCCCCGACTCCGACGCTGAGTCCTTTGC 
TTGTAGGAGACTCGGGCGGGGAAGACCGACGGGGGGGCTGAGGCTGCGACTCAGGAAACG 



FIG. 21 



A 1 aSe rA rgSerPheG 1 ySerSerSerThrSerGl y I leThrG 1 yAspAsnTh rThrTh r 
1 GCCTCCAGAAGCTTTGGCAGCTCCTCAACTTCCGGCATTACGGGCGACAATACGACAACA 
CGGAGGTCTTCGAAACCGTCGAGGAGTTGAAGGCCGTAATGCCCGCTGTTATGCTGTTGT 

Overlap with 33f 

SerSerGluProAlaProSerGlyCysProProAspSerAspAlaGluSerTyrSerSer 
61 TCCTCTGAGCCCGCCCCTTCTGGCTGCCCCCCCGACTCCGACGCTGAGTCCTATTCCTCC 
AGGAGACTCGGGCGGGGAAGACCGACGGGGGGGCTGAGGCTGCGACTCAGGATAAGGAGG 

MetProProLeuG 1 uG 1 yG 1 uProG 1 yAspProAspLeuSe r AspG 1 ySerTrpSerThr 
121 ATGCCCCCCCTGGAGGGGGAGCCTGGGGATCCGGATCTTAGCGACGGGTCATGGTCAACG 
TACGGGGGGGACCTCCCCCTCGGACCCCTAGGCCTAGAATCGCTGCCCAGTACCAGTTGC 

ValSerSerGluAlaAsnAlaGluAspValValCysCysSerMetSerTyrSerTrpThr 
181 GTCAGTAGTGAGGCCAACGCGGAGGATGTCGTGTGCTGCTCAATGTCTTACTCTTGGACA 
CAGTCATCACTCCGGTTGCGCCTCCTACAGCACACGACGAGTTACAGAATGAGAACCTGT 

GlyAlaLeuValThrProCysAlaAlaGluGluGlnLysLeuProIleAsnAlaLeuSer 
241 GGCGCACTCGTCACCCCGTGCGCCGCGGAAGAACAGAAACTGCCCATCAATGCACTAAGC 
CCGCGTGAGCAGTGGGGCACGCGGCGCCTTCTTGTCTTTGACGGGTAGTTACGTGATTCG 

AsnSerLeuLeuArgHisHisAsnLeuvalTyrSerThrThrSerArgSer 
301 AACTCGTTGCTACGTCACCACAATTTGGTGTATTCCACCACCTCACGCAGTG 
TTGAGCAACGATGCAGTGGTGTTAAACCACATAAGGTGGTGGAGTGCGTCAC 



FIG. 22 

GlyThrTyrValTyrAsnHisLeuThrProLeuArgAspTrpAlaHisAsnGlyLeuArg 
1 GGCACCTATGTTTATAACCATCTGACTCCTCTTCGGGACTGGGCGCACAACGGCTTGCGA 
CCGTGGATACAAATATTGGTAGAGTGAGGAGAAGCCCTGACCCGCGTGTTGCCGAACGCT 

AspLeuAlaValAlaValGluProValValPheSerGlnMetGluThrLysLeuIleThr 
61 GATCTGGCCGTGGCTGTAGAGCCAGTCGTCTTCTCCCAAATGGAGACCAAGCTCATCACG 
CTAGACCGGCACCGACATCTCGGTCAGCAGAAGAGGGTTTACCTCTGGTTCGAGTAGTGC 

_ TrpGlyAlaAspThrAlaAlaCysGlyAspIlelleAsnGlyLeuProValSerAlaArg 
121 TGGGGGGCAGATACCGCCGCGTGCGGTGACATCATCAACGGCTTGCCTGTTTCCGCCCGC 
ACCCCCCGTCTATGGCGGCGCACGCCACTGTAGTAGTTGCCGAACGGACAAAGGCGGGCG 



A rgG lyArgGluI 1 eLeuLeuG 1 yProA 1 aAspG 1 yMetVa 1 Se rLysG 1 yTrpA rgLeu 
181 AGGGGCCGGGAGATACTGCTCGGGCCAGCCGATGGAATGGTCTCCAAGGGTTGGAGGTTG 
TCCCCGGCCCTCTATGACGAGCCCGGTCGGCTACCTTACCAGAGGTTCCCAACCTCCAAC 

LeuAlaProIleTtarAlaTyrAlaGlnGlnThrArgGlyLeuLeuGlyCysIlelLeThr 
241 CTGGCGCCCATCACGGCGTACGCCCAGCAGACAAGGGGCCTCCTAGGGTGCATAATCACC 
GACCGCGGGTAGTGCCGCATGCGGGTCGTCTGTTCCCCGGAGGATCCCACGTATTAGTGG 

Overlap with 7e 

Se r LeuTh rG 1 yA rgAspLysAsnG 1 nVa 1 G 1 uG 1 yG 1 uVa 1 G 1 n 1 1 eVa 1 Se rTh r A 1 a 
301 AGCCTAACTGGCCGGGACAAAAACCAAGTGGAGGGTGAGGTCCAGATTGTGTCAACTGCT 
TCGGATTGACCGGCCCTGTTTTTGGTTCACCTCCCACTCCAGGTCTAACACAGTTGACGA 

AlaGl nTh r PheLeuA 1 aTh r Cy s 1 1 e AsnG 1 y Va 1 Cy sTrp 
361 GCCCAAACCTTCCTGGCAACGTGCATCAATGGGGTGTGCTGG 
CGGGTTTGGAAGGACCGTTGCACGTAGTTACCCCACACGACC 



FIG. 23 

GlyGlyValValLeuValGlyLeuMetAlaLeuThrLeuSerProTyrTyrLysArgTyr 
1 GGCGGTGTTGTTCTCGTCGGGTTGATGGCGCTGACTCTGTCACCATATTACAAGCGCTAT 
CCGCCACAACAAGAGCAGCCCAACTACCGCGACTGAGACAGTGGTATAATGTTCGCGATA 

HeSerTrpCysLeuTrpTrpLeuGlnTyrPheLeuThrArgValGluAlaGlnLeuHis 
6 1 ATCAGCTGGTGCTTGTGGTGGCTTCAGTATTTTCTGACCAGAGTGG AAGCGCAACTGCAC 
TAGTCGACCACGAACACCACCGAAGTCATAAAAGACTGGTCTCACCTTCGCGTTGACGTG 

valTrpIleProProLeuAsnValArgGlyGlyArgAspAlaVallleLouLeuMetCys 
121 GTGTGG ATTCCCCCCCTCAACGTCCGAGGGGGGCGCGACGCCGTCATCTTACTCATGTGT 
CACACCTAAGGGGGGGAGTTGCAGGCTCCCCCCGCGCTGCGGCAGTAGAATGAGTACACA 

AlaValHisProThrLeuValPheAspIleThrLysLeuLeuLeuAlaValPheGlyPro 
181 CCTGTACACCCGACTCTGGTATTTGACATCACCAAATTGCTGCTGGCCGTCTTCGGACCC 
CGACATGTGGGCTGAGACCATAAACTGTAGTGGTTTAACGACGACCGGCAGAAGCCTGGG 

LeuTrpIleLeuGlnAlaSerLeuLeuLysValProTyrPheValArgValGlnGlyLeu 
241 CTTTGGATTCTTCAAGCCAGTTTGCTTAAAGTACCCTACTTTGTGCGCGTCCAAGGCCTT 
GAAACCTAAGAAGTTCGGTCAAACGAATTTCATGGGATGAAACACGCGCAGGTTCCGGAA 

.euArgPheCysAlaLeuAlaArgLysMetllcGlyGlyHisTyrValGlnMetVallle 
3 0 1 CTCCGGTTCTGCGCGTTAGCGCGGAAGATGATCGGAGGCCATTACGTGCAAATGGTCATC 
GAGGCCAAGACGCGCAATCGCGCCTTCTACTAGCCTCCGGTAATGCACGTTTACCAGTAG 



IleLysLeuGlyAlaLeuThrGlyThrTyrValTyrAsnHisLeuThrProLeuArgAsp 

3 6 1 ATTAAGTTAGGGGCGCTTAGTGGCACCTATGTTTATAACCATCTCACTCCTCTTCGGG AC 

TAATTCAATCCCCGCGAATGACCGTGGATACAAATATTGGTAGAGTGAGGAGAAGCCCTG 

Overlap with It 

TrpAlaHisAsnGlyLeuArgAspLeuAlaValAlaValGluProValValPheSerGln 

4 21 TGGGCGCACAACGGCTTGCGAG ATCTGGCCGTGGCTGTAGAGCCAGTCGTCTTCTCCCAA 

ACCCGCGTGTTGCCGAACGCTCTAGACCGGCACCGACATCTCGGTCAGCAGAAGAGGGTT 



MetGluThrLy sLeu I leThrTrpGly 
4 81 ATGGAG ACCAAGCTCATCACGTGGGGGGC 
TACCTpTGGTTCGAGTAGTGCACCCCCCG 




FIG. 24 

6 1 uTy rVa 1 Va 1 LeuLeuPheLeuLeuLeuA 1 aAspA 1 aA rgVa 1 Cy sSe rCy sLeuTrp 
1 66GA6TAC6TC6TTCTCCTGTTCCnCT6Cn6CA6ACGC6CGCGTCTGCTCCT6CTTGT 
CCCTCATGCAGCAAGAGGACAAGGAAGACGAACGTCTGCGCGCGCAGACGAGGACGAACA 

MetMetLeuLeuI leSerGlnAlaGluAlaAlaLeuGluAsnLeuVal I leLeuAsnAla 
61 GGATGATGCTACTCATATCCCAAGCGGAGGCGGCTTTGGAGAACCTCGTAATACTTAATG 
CCTACTACGATGAGTATAGGGTTCGCCTCCGCCGAAACCTCTTGGAGCATTATGAATTAC 

i2i zMmmmlml&mlmmmlm ,, 

GTCGTAGGGACCGGCCCTGCGTGCCAGAACATAGGAAG6AGCACAAGAAGACGAAACGTA 

TyrLeuLysGlyLysTrpValProGlyAlaValTyrThrPheTyrGlyMetTrpProLeu 
181 GGTATTTGAAGGGTAAGTGGGTGCCCGGAGCGGTCTACACCTTCTACGGGATGTG6CCTC 
CCATAAACTTCCCATTCACCCACGGGCCTCGCCAGATGTGGAAGATGCCCTACACCGGAG 

oii T ^LeuLeuLeuLeuLeuAlaLeuProGlnArgAlaTyrAlaLeuAspThrGluVal^ 
241 TCCTCCTGCTCCTGTTGGCGTTGCCCCAGCGGGCGTACGCGCTGGACACGGAGGTGGCCG 
AGGAGGACGAGGACAACCGCAACGGGGTCGCCCGCATGCGCGACCTGTGCCTCCACCGGC 

Overlap with lib ■ 

™ , .^Se r Cy sG 1 yG 1 y Va 1 Va 1 LeuVa 1 G 1 y LeuMe t A laLeuTh rLeuSe r ProTy rTy r Lys 
301 CGTCGTGTGGCGGTGTTGTTCTCGTCGGGTTGATGGCGCTGACTCTGTCACCATATTACA 
GCAGCACACCGCCACAACAAGAGCAGCCCAACTACCGCGACTGAGACAGTGGTATAATGT 

ArgTyrl leSerTrpCysLeuTrpTrpLeuGln 
361 AGCGCTATATCAGCTGGTGCTTGTGGTGGCTTCAGAA 
TCGCGATATAGTCGACCACGAACACCACCGAAGTCTT 

FIG. 25 

, ProAlaProSerGlyCysProProAspSerAspAlaGluSerTyrSerSerMetProPro 
1 CCAGCCCCTTCTGGCTGCCCCCCCGACTCCGACGCTGAGTCCTATTCCTCCATGCCCCCC . 
GGTCGGGGAAGACCGACGGGGGG6CTGAGGCTGCGACTCAGGATAA6GAGGTACGGGGGG 

„, LeuGluGlyGluProGlyAspProAspLeuSerAspGlySerTrpSerThrValSerSer 
61 CTGGAGGGGGAGCCTGGGGATCCGGATCTTAGCGACGGGTCATGGTCAACAGTCAGTAGT 
GACCTCCCCCTCGGACCCCTAGGCCTAGAATCGCTGCCCAGTACCAGTTGTCAGTCATCA 

----- Overlap with 33g 

GluAlaAsnAlaGluAspValValCysCysSerMetSerTyrSerTrpThrGlyAlaLeu 
121 GAGGCCAACGCGGAGGATGTCGTGTGCTGCTCAATGTCCTACTCTTGGACAGGCGCACTC 
CTCCGGTTGCGCCTCCTACAGCACACGACGAGTTACAGGATGAGAACCTGTCCGCGTGAG 

101 ValThrProCysAlaAlaGluGluGlnL^^ 

181 GTCACCCCGTGCGCCGC6GAAGAACAGAAACTGCCCATCAATGCACTGAGCAACTCGTTG 
CAGTGGGGCACGCGGCGCCTTCTTGTCTTTGACGGGTAGTTACGTGACTCGTTGAGCAAC 

o , , LeuArgHisH 1 s AsnLeuVa 1 Ty rSe rThrThrSer A rgSerA 1 aCysG I nA rgG 1 nLys 
2m CTACGTCACCACAATTTGGTGTATTCCACCACCTCACGCAGTGCTTGCCAAAGGCAGAAG 
GATGCAGTGGTGTTAAACCACATAAGGTGGTGGAGTGCGTCACGAACGGTTTCCGTCTTC 

„„, LysValThrPheAspArgLeuG 1 nVa 1 LeuAspSerH 1 sTy rG 1 n A spVa 1 LeuLysG 1 y 
301 AAAGTCACATTTGACAGACTGCAAGTTCTGGACAGCCATTACCAGGACGTACTCAAGGAG 
TTTCAGTGTAAACTGTCTGACGTTCAAGACCTGTCGGTAATGGTCCTGCATGAGTTCCTC 

_ , Va 1 LysA 1 a A 1 aA 1 aSer LysVa 1 LysA 1 aAsnPhe 
361 GTTAAAGCAGCGGCGTCAAAAGTGAAGGCTAACTTC 
CAATTTCGTCGCCGCAGTTTTCACTTCCGATTGAAG 



FIG. 26A 



6 1 uTy rVa IVa 1 LeuLeuPheLeuLeuleuA 1 aAspA 1 aA rgVa 1 Cy sSe rCy sLeuTrp 
1 GGGAGTAC6TCGTTCTCCT6TTCCTTCTGCTTGCA6ACGCGCGeGTCT6CTCCTGCTTGT 
CCCTCATGCAGCAAGAGGACAAGGAAGACGAACGTCTGCGCGCGCAGACGAGGACGAACA 

MetMetLeuLeu 1 1 eSerG InAlaGI uAlaA laLeuG 1 irAsnLeuVa 1 1 1 eLeuAsnAla 
61 GGATGATGCTACTCATATCCCAAGCGGAGGCGGCTTTGGAGAACCTCGTAATACTTAATG 
CCTACTACGAT6AGTATAGGGTTCGCCTCCGCCGAAACCTCTTGGAGCATTATGAATTAC 

A laSerLeuAl aG lyThrH i sGlyLeuVa lSerPheLeuVal PhePheCysPheAl aTrp 
121 CAGCATCCCTGGCCGGGACGCACGGTCTTGTATCCTTCCTCGTGTTCTTCTGCTTTGCAT 
GTCGTAGGGACCGGCCCTGCGTGCCAGAACATAGGAAGGAGCACAAGAAGACGAAACGTA 

TyrLeuLysGlyLysTrpValProGlyAlaValTyrThrPheTyrGlyMetTrpProLeu 
181 GGTATTTGAA6GGTAAGTGGGTGCCCGGAGCGGTCTACACCTTCTACGGGATGTGGCCTC 
CCATAAACTTCCCATTCACCCACGGGCCTCGCCAGATGTGGAAGATGCCCTACACCGGAG 

LeuLeuLeuLeuLeuA 1 aLeuP roG 1 nArgA 1 aTy r A 1 aLeuAspTh r G 1 uVa 1 A 1 a A 1 a 
211 TCCTCCTGCTCCTGTTGGCGTTGCCCCAGCGGGCGTACGCGCTGGACACGGAGGTGGCCG 
AGGAGGACGAGGACAACCGCAACGGGGTCGCCCGCATGCGCGACCTGTGCCTCCACCGGC 

Se r CysG 1 yG 1 y Va 1 Va 1 Leuva 1 G 1 y LeuMe tA 1 aLeiiTh rLeuSe rProTy rTy r Ly s 
301 CGTCGTGTGGCGGTGTT6TTCTCGTCGGGTTGATGGCGCTGACTCTGTCACCATATTACA 
GCAGCACACCGCCACAACAAGAGCAGCCCAACTACCGCGACTGAGACAGTGGTATAATGT 

ArgTyrlleSerTrpCysLeuTrpTrpLeuGlnTyrPheLeuThrArgValGluAlaGIn 
361 AGCGCTATATCAGCTGGTGCTTGTGGTGGCTTCAGTATTTTCTGACCAGAGTGGAAGCGC 
TCGCGATATAGTCGACCACGAACACCACCGAAGTCATAAAAGACTGGTCTCACCTTCGCG 

LeuHisValTrpIleProProLeuAsnValArgGlyGlyArgAspAlaVallleLeuLeu 
421 AACTGCACGTGTGGATTCCCCCCCTCAACGTCCGAGGGGGGCGCGACGCCGTCATCTTAC 
TTGACGTGCACACCTAAGGGGGGGAGTTGCAGGCTCCCCCCGCGCTGCGGCAGTAGAATG 

MetCysA 1 aVa 1 H i sProThrLeuVa 1 PheAspI 1 eThrLysLeuLeuLeuA 1 aVa I Phe 
481 TCATGTGTGCTGTACACCCGACTCTG6TATTTGACATCACCAAATTGCTGCTGGCCGTCT 
AGTACACACGACATGTGGGCTGAGACCATAAACTGTAGTGGTTTAACGACGACCGGCAGA 

G 1 yProLeuTrp IleLeuGlnAl aSerLeuLeuLysVa 1 ProTy rPheVa 1 A rgValG 1 n 
541 TCGGACCCCTTTGGATTCTTCAAGCCAGTTTGCTTAAAGTACCCTACTTTGTGCGCGTCC 
AGCCTGGGGAAACCTAAGAAGTTCGGTCAAACGAATTTCATG6GATGAAACAC6CGCAGG 

GlyLeuLeuArgPheCysAlaLeuAlaArgLysMetlleGlyGlyHisTyrValGlnMet 
601 AAGGCCTTCTCCGbTTCTGCGCGTTAGCGCGGAAGATGATCGGAGGCCATTACGTGCAAA 
TTCCGGAAGAGGCCAAGACGCGCAATCGCGCCTTCTACTAGCCTCCGGTAATGCACGTTT 

Va 1 1 le 1 1 eLysLeuG lyA laLeuThrG lyTh rTy rVa lTy rAsnH 1 sLeuThrProLeu 
661 TGGTCATCATTAAGTTAGGGGCGCTTACTGGCACCTATGTTTATAACCATCTCACTCCTC 
ACCAGTAGTAATTCAATCCCCGCGAATGACCGTGGATACAAATATTGGTAGAGTGAGGAG 

ArgAspTrpAlaH i sAsnG lyLeuArgAspLeuA 1 aVa 1 A 1 aVa 1 G 1 uProVa 1 Va 1 Phe 
721 TTCGGGACT6GGCGCACAACGGCTTGCGAGATCTGGCCGTGGCTGTAGAGCCAGTCGTCT 
AAGCCCTGACCCGCGTGTTGCCGAACGCTCTAGACCGGCACC6ACATCTCGGTCAGCAGA 

SerGlnMetGluThrLysLeuIleThrTrpGlyAlaAspThrAlaAlaCysGlyAspIle 
781 TCTCCCAAATGGAGACCAAGCTCATCACGTGGGGGGCAGATACCGCCGCGTGCGGTGACA 
AGAGGGTTTACCTCTGGTTCGAGTAGTGCACCCCCCGTCTATGGCGGCGCACGCCACTGT 

IleAsnGl yLeuProVa 1 Se r A 1 aArgA rgG lyArgGluIl eLeuLeuG 1 y Pro A 1 aAsp 
841 TCATCAACGGCTTGCCTGTTTCCGCCCGCAGGGGCCGGGAGATACTGCTCGGGCCAG^ 
AGTAGTTGCCGAACGGACAAAGGCGGGCGTCCCCGGCCCTCTATGACGAGCCCGGTCGGC 

G 1 yMe tVa 1 Se r LysG 1 yTr pA rgLeuLeuA 1 aPro 1 1 eTh r A 1 aTy rA 1 aG 1 nG 1 nTh r 
901 ATGGAATGGTCTCCAAGGGGTGGAGGTTGCTGGCGCCCATCACGGCGTACGCCCAGCAGA 
TACCTTACCAGAGGTTCCCCACCTCCAACGACCGCGGGTAGTGCCGCATGCGGGTCGTCT 



FIG. 26B 

Arg61yLeuLeu61yCysIleIleThrSerLeuThrGlyArgAspLysAsnGlnyalGlu 
961 CAAG6GGCCTCCTAGGGTGCATAATCACCAGCCTAACTGGCCGGGACAAAAACCAAGTGG 
GTTCCCCGGAGGATCCCACGTATTAGTGGTCGGATT6ACCGGCCCTGTTTTTGGTTCACC 

G 1 yG 1 uVa 1 G 1 n 1 1 eVa lSerThrA 1 a A 1 aG 1 nThrPheLeuA 1 aThrCys IleAsnGly 
1021 AGGGTGAGGTCCAGATTGTGTCAACTGCTGCCCAAACCTTCCTGGCAACGTGCATCAATG 
TCCCACTCCAGGTCTAACACA6TTGACGACGGGTTTGGAAGGACCGTTGCACGTAGTTAC 

Va 1 CysTr pThrVa 1 Ty rH i sG 1 yA 1 aG 1 yThrA rgTh rl leAl aSer ProLysG 1 y Pro 
1081 GGGTGTGCTGGACTGTCTACCACGGGGCCGGAACGAGGACCATCGCGTCACCCAAGGGTC 
CCCACACGACCTGACAGATGGTGCCCCGGCCTTGCTCCTGGTAGCGCAGTGGGTTCCCAG 

VallleGlnMetTyrThrAsnValAspGlnAspLeuValGlyTrpProAlaProGlnGly 
1141 CTGTCATCCAGATGTATACCAATGTAGACCAAGACCTTGTGGGCTGGCCCGCTCCGCAAG 
GACAGTAGGTCTACATATGGTTACATCTGGTTCTGGAACACCCGACCGGGCGAGGCGTTC 

SerArgSerLeuThrProCysThrCysGlySerSerAspLeuTyrLeuValThrArgHis 
1201 GTAGCCGCTCATTGACACCCTGCACTTGCGGCTCCTCGGACCTTTACCTGGTCACGAGGC 
CATCGGCGAGTAACTGTGGGACGTGAACGCCGAGGAGCCTGGAAATGGACCAGTGCTCCG 

A 1 a AspVa 1 1 1 eP roVa 1 A rgArgA rgG 1 yAspSer A rgG 1 ySer LeuLeuSe rProA rg 
1261 ACGCCGATGTCATTCCCGTGCGCCGGCGGGGTGATAGCAGGGGCAGCCTGCTGTCGCCCC 
TGCGGCTACAGTAAGGGCACGCGGCCGCCCCACTATCGTCCCCGTCGGACGACAGCGGGG 

Pro 1 1 eSe rTy r LeuLysG 1 ySe r SerG 1 yG 1 y ProLeuLeuCysProA laGlyHlsAla 
1321 GGCCCATTTCCTACTTGAAAGGCTCCTCGGGGGGTCCGCTGTTGTGCCCCGCGGGGCACG 
CCGGGTAAAGGATGAACTTTCCGAGGAGCCCCCCAGGCGACAACACGGGGCGCCCCGTGC 

Va 1 G 1 y 1 1 ePhe A rgA 1 a A 1 aVa 1 Cy sTh r A r gG 1 y Va 1 A 1 aLy s A 1 a Va 1 AspPhe 1 1 e 
1381 CCGTGGGCATATTTAGGGCCGCGGTGTGCACCCGTGGAGTGGCTAAGGCGGTGGACTTTA 
GGCACCCGTATAAATCCCGGCGCeACACGTGGGCACCTCACCGATTCCGCCACCTGAAAT 

ProValGluAsnLeuGluThrThrMetArgSerProValPheThrAspAsnSerSerPro 
mi TCCCTGTGGAGAACCTAGAGACAACCATGAGGTCCCCGGTGTTCACGGATAACTCCTCTC 
AGGGACACCTCTTGGATCTCTGTTGGTACTCCAGGGGCCACAAGTGCCTATTGAGGAGAG 

ProVa 1 Va 1 ProG 1 nSe r PheG 1 nVa 1 A 1 aH i sLeuH i sA 1 aProThrG 1 ySe rG 1 yLys 
1501 CACCAGTAGTGCCCCAGAGCTTCCAGGTGGCTCACCTCCATGCTCCCACAGGCAGCGGCA 
GTGGTCATCACGGGGTCTCGAAGGTCCACCGAGTGGAGGTACGAGGGTGTCCGTCGCCGT 

SerTh rLysVa 1 ProA 1 aA 1 aTy r A 1 a A 1 aG 1 nG 1 yTy rLysVa 1 LeuVa 1 LeuAsnPro 
1561 AAAGCACCAAGGTCCCGGCTGCATATGCAGCTCAGGGCTATAAGGTGCTAGTACTCAAC^ 
TTTCGTGGTTCCAGGGCCGACGTATACGTCGAGTCCCGATATTCCACGATCATGAGTTGG 

SerValAlaAlaThrLeuGlyPheGlyAlaTyrHetSerLysAlaHisGlylleAspPro 
1621 CCTCTGTTGCTGCAACACTGGGCTTTGGTGCTTACATGTCCAAGGCTCATGGGATCGATC 
GGAGACAACGACGTTGTGACCCGAAACCACGAATGTACAGGTTCCGAGTACCCTAGCTAG 

Asn 1 1 e A rgTh rG 1 y Va 1 A rgTh r 1 1 eTh rTh rG 1 ySe r Pro 1 1 eTh rTy r Se rTh rTy r 
1681 CTAACATCAGGACCGGGGTGAGAACAATTACCACTGGCAGCCCCATCACGTACTCCACCT 
GATTGTAGTCCTGGCCCCACTCTTGTTAATGGTGACCGTCGGGGTAGTGCATGAGGTGGA 

GlyLysPheLeuAlaAspGlyGlyCysSerGlyGlyAlaTyrAspnellelleCysAsp 
1711 ACGGCAAGnCCnGCCGACGGCGGGTGCTCGGGGGGCGCTTATGACATAATAATTTGTG 
TGCCGTTCAAGGAACGGCTGCCGCCCACGAGCCCCCCGCGAATACTGTATTATTAAACAC 

GluCysHisSerThrAspAlaThrSerlleLeuGlylleGlyThrValLeuAspGlnAla 
1801 ACGAGTGCCACTCCACGGATGCCACATCCATCTTGGGCATCGGCACTGTCCTTGACCAAG 
TGCTCACGGTGAGGTGCCTACGGTGTAGGTAGAACCCGTAGCCGTGACAGGAACTGGTTC 

G luTh r A 1 aG 1 yA 1 aA rgLeuVa 1 Va lLeuAl aTh r A 1 aTh rProP roG 1 ySerVa 1 Th r 
1861 CAGAGACTGCGGGGGCGAGACTGGTTGTGCTCGCCACCGCCACCCCTCCGGGCTCCGTCA 
GTCTCTGACGCCCCCGCTCTGACCAACACGAGCGGTGGCGGTGGGGAGGCCCGAGGCAGT 

Va 1 ProH i sProAsn 1 1 eG 1 uG 1 uVa 1 A 1 aLeuSe rTh rTh rG 1 yG JuIleProPheTyr 
1921 CTGTGCCCCATCCCAACATCGAGGAGGnGCTCTGTCCACCACCGGAGAGATa 

GACACGGGGTAGGGTTGTAGCTCCTCCAACGAGACAGGTGGTGGCCTCTCTAGGGAAAAA 



FIG. 26C 

l»lttfjn 

T6CC6TTCC6ATAGGGGGAGCnCAnA6TTGGGCCCCTeTGTA6A6TAGAAGACAGTAA 

orw , ^AY§lr)f^^ sAs P GlL,Le uAlaAlaLysLeuValAlaLeuGlyIleAsnAlaValAla 
2041 CA A AGA AGA AGTGCGACGAACTM 

GTTTCTTCTTCACGCTGCTTGAGCGGCGTTTCGACCAGCGTAACCCGTAGTTACGGCACC 
,, m TyrTyrArgGlyLeuAspVa 1 Se rVa 1 1 1 eProTh rSe rG 1 y A spVa 1 Va 1 Va 1 Va 1 A 1 a 
GGATGATGGCGCCAGAACTGCACAGGCAGTAGGGCTGGTCGCCGCTACAACAGCAGCACC 

2161 CAACCGATGCCCTCATGACCGG^ 

GTTGGCTACGGGAGTACTGGCCGATATGGCCGCTGAAGCTGAGCCACTATCTGACGTTAT 

^ ^^^?y§JiTD^ ! nThrVa 1 AspPheSe rLeuAspProTh rPheTh rl leGluThrl leThr 

2221 mimmimmjmmmjmi^ 

GCGAGGGGGTCCTACGACAGAGGGCGTGAGTTGCAGCCCCGTCCTGACCGTCCCCCTTCG 

GTCCGTAGATGTCTAAACACCGTGGCCCCCTCGCGGGGAGGCCGTAGAAGCTGAGCAGGC 

2.01 TttTCTGTGAGT^ 

AGGAGACACTCACGATACTGCGTCCGACACGAACCATACTCGAGTGCGGG^ 

9^1 rflrTT A r£r?A&^M^ 1 "AspH isLetlG 1 U 

2.61 CAGTTAGGCTACGAGCGTACATGAACACCCCGGGGCTTCCCGTGTGCCAGGACCATfTTf; 

gtcaatccgatgctcgcatgtacttgtggggccccgaagggcacaS 

. PhfTrpGluGlyValPheThrGlyLeuThrHisIleAspAlaHisPheLeuSerGlnThr 
2521 AATTTTGGGAGGGCGTCTTTACAGG 

TTAAAACCCTCCCGCAGAAATGTCCGGAGTGAGTATATCTAC6GGTGAAAGATAGGGTCT 
, ~. (rV sG 1 n Se rG 1 yG 1 uAsnLeuProTy r LeuVa 1 A 1 aTy rG 1 nA 1 aTh rVa 1 CvsA 1 aAra 

2H1 ^I!H™M 

CCCGAGTTCGGGGAGGGGGTAGCACCCTGGTCTACACCTTCACAAACTAAGCGGAGTTCG 

97m rrlrf r?HPll^X^TJ!rK2l:?yfc§yI^i: 9LeuG 1 VA 1 aVa 1 G 1 nAsnG 1 u 1 1 eThr 
2701 CCACCCTCCATGGGCCAACACa 

GGTGGGAGGTACCCGGTTGTGGGGACGATATGTCTGACCCGCGACAAGTCTTACTTTAGT 
07ci rrJ=?^IP^^^9y?jiT^lry?Iy rIIeMetT hrCysMetSerAlaAspLeuGluValVal 

AGTGCTCGTGGACCCACGAGCAACCGCCGCAGGACCGACGAAACCGGCGCATAACGGACA 
GTTGTCCGACGCACCAGTATCACCCGTCCCAGCAGAACAGGCCCTTCGGCCGTTAGTATG 



FIG. 26D 

, nm TyrlleGluGlnGlyMetM^^ 
3001 CGTACATCGAGCAAGGGATGATGCTC^ 

GCATGTAGCTCGTTCCCTACTACGAGCGGCTGGTCAAGTTCGTCTTCCGGGAGCCGGAGG 

Tnc i T J! pThrAlaSerArgg 1 nA 1 aG 1 uVa 1 1 1 eA 1 aProA 1 aVa 1 G 1 nTh r AsnT rpG 1 nLys 
3061 TGCAGACCGCGTCCCGTCAGGCAGAGGTTATCGCCCCTGCTGTCCAGACC 

ACGTCTGGCGCAGGGCAGTCCGTCTCCAATAGCGGGGACGACAGGTCTGGTTGACCGTTT 

„ ^ , . LeuG 1 uTh rPheTr pA 1 aLy sH 1 sMe tTrp AsnPhe 1 1 eSe rGlyl leGl nTy rLeuA 1 a 
3121 AACTCGAGACCTTCTGGGCGAAG^ 

TTGAGCTCTGGAAGACCCGCTTCGTATACACCTTGAAGTAGTCACCCTATGTTATGAACC 

^Gl^Uy^CTDt^uProS^AsnPr^JaneAlaSerLeuMetAlaPheThrAlaAla 
3181 CGGGCTTGTCAACGCTGCCTGGTAACCCCGCCATTGCTTCATTGATGGCTTTTACAGCTG 
GCCCGAACAGTTGCGACGGACCATTGGGGCGGTAACGAAGTAACTACCGAAAATGTCGAC 

i r T y?JiT^§^^9k?yT h C ThrSerG 1 nTh rLeuLeuPheAsn I leLeuGlyGl yTrpVa 1 

ACCGACGGGTCGAGCGGCGGGGGCCACGGCGATGACGGAAACACCCGCGACCGAATCGAC 

CGCGGCGGTAGCCGTCACAACCTGACCCCTTCCAGGAGTATCTGTAGGAACGTCCCATAC 

^,o, r. Jla§IyValAlaGlyAlaLeuValAlaPheLysIleMetSerGlyGluValProSerThr 
3121 GCGCGGGCGTGGCGGGAGCTCTTGTGGCATTCAAGATCATGAGCGGTGAGGTCCCa 
CGCGCCCGCACCGCCCTCGAGAACACCGTAAGTTCTAGTACTCGCCACTCCAGGGGAGGT 

7(l01 G 1 uAspLeuVa 1 AsnLeuLeuProA 1 a 1 1 eLeuSe rProG 1 yA 1 aLeuVa 1 Va 1 G 1 yVa 1 
3181 CGGAGGACCTGGTCAATCTACTGCCCGCC 

GCCTCCTGGACCAGTTAGATGACGGGCGGTAGGAGAGCGGGCCTCGGGAGCATCAGCCGC 

, ValCysAlaAlal leLeuArgArgHisValGlyProGlyGluGIyAlaValGlnTrpMet 
3511 JGGTCTGTGCAGCAATACTGCGCCGGCACGTTGGCCCGGGCGAGGGGGCAGTGCAGTGGA 
ACCAGACACGTCGTTATGACGCGGCCGTGCAACCGGGCCCGCTCCCCCGTCACGTCACCT 

7cm Tr AsnArgLeuIleAlaPheAlaSerArgGlyAsnHisValSerProThrHisTyrValPro 
3601 TGAACCGGCTGATAGCCTTCGCCTCCCGGGGGAACCATGTTTCCCCCACGCACTACGTGC 
ACTTGGCCGACTATCGGAAGCGGAGGGCCCCCTTGGTACAAAGGGGGTGCGTGATGCACG 

7CC1 ^GluSerAspAlaAlaAlaArgValThrAlalleLeuSerSerLeuThrValTh 
3661 CGGAGAGCGATGCAGCTGCCCGCGTCACTGCCATACTCAGCAGCCTCACTGTAACCCAGC 
GCCTCTCGCTACGTCGACGGGCGCAGTGACGGTATGAGTCGTCGGAGTGACATTGGGTCG 

„„ . L?y A £9 A rgLeuH isGlnTrpIleSe rSe rG l uCysTh rTh r ProCy sSe rG l ySe rTrp 
3721 TCCTGAG6CGACTGCACCAGT6MTAA6CTM 

AGGACTCCGCTGACGTGGTCACCTATTCGAGCCTCACATGGTGAGGTACGAGGCCAAGGA 

„ 0 . LeuArgAspIleTrpAs^^ 

3781 GGCTAAGGGACATCTGGGACT6GATATGCGAGGTGTTGAGCGACTTTAAGACCTGGCTAA 
CCGATTCCCTGTAGACCCTGACCTATACGCTCCACAACTCGCTGAAATTCTGGACCGATT 

™, , . AlaLysLeuMetProGlnLeuProGlylleProPheValSerCysGlnArgGlyTyrLys 
3811 AAGgAAGCTCATGCCACAGCTGCCTGGGATCCCCnTGTGTCCTkcAGCGCGGGTATA 
TTCGATTCGAGTACGGTGTCGACGGACCCTAGGGGAAACACAGGACGGTCGCGCCCATAT 

. ^GlyValTrpArgVa 1 AspG 1 y 1 1 eMetH 1 sThrA rgCysH 1 sCysG 1 yA 1 aG 1 u 1 1 eThr 
3901 AGGGGGTCTGGCGAGTGGACGGCATCATGCACACTCG 

TCCCCCAGACCGCTCACCTGCCGTAGTACGTGTGAGCGACGGTGACACCTCGACTCTAGT 



FIG. 26E ■ r% ■ * n 

SerG 1 yThrPheProI 1 eAsnA 1 aTv rTh rTh rG lyProCysThrProLeuProAlaPro 
4021 6GA6TGG6ACCTTCCCCAnAATGCCTACACCACGG6CCCg6TACCCCCCTTCCTGCGC 
CCTCACCCTGGAAGGGGTAATTACGGATGTGGTGCCGGGGGACATGGGGGGAAGGACGCG 

AsnTy rThrPheA 1 aLeuTr pArgVa 1 Ser A laG 1 UG 1 uTyrValG 1 u 1 1 eAroG 1 nVa 1 
4081 CGAACTACACGTTCGCGCTATGGAGGGTGTCTGCAGAGGAATATGTGGAGATAAGGQ 
GCTTGATGTGCAAGCGCGATACCTCCCACAGACGTCTCCTTATACACCTCTATTCCGTCC 

GlyAspPheHisTyrValThrGlyMetThrThrAspAsnLeuLysCysProCysGlnVal 
4141 TGGGGGACTTCCACTACGTGACGGGTATGACTACTGACAATCTCAAATGCCCGTGCa 
ACCCCCTGAAGGTGATGCACTGCCCATACTGATGACTGTTAGAGTTTACGGGCACGGTCC 

ProSerProG 1 uPhePheThrG 1 uLeuAspG 1 yVa 1 A rgLeuH i sArgPheAl aProPro 
4201 TCCCATCGCCCGAATTTTTCACAGAATTGGACGGGGTGCGCCTACATAGGTTTGCGCCCC 
AGGGTAGCGGGCTTAAAAAGTGTCTTAACCTGCCCCACGCG6ATGTATCCAAACGCGGGG 

CysLysProLeuLeuArgG 1 uG 1 uVa 1 Se rPheArgVa 1 G I yLeuH 1 sG 1 yTy rProVa 1 
4261 CCTGCAAGCCCnGCTGCGGGAGGAGGTATCATTCAGfc^^ 

GGACGTTCGGGAACGACGCCCTCCTCCATAGTAAGTCTCATCCTGAGGTGCTTATGGGCC 

GlySerG InLeuProCysG 1 uProG luProAspVa 1 A laVal LeuThrSerMetLeuThr 
4321 TAGGGTCGCAATTACCTTGCGAGCCCGAACCGGACGTGGCCGTGTTGACGTCCATGCTCA 
ATCCCAGCGTTAATGGAACGCTCGGGCTTGGCCTGCACCGGCACAACTGCAGGTACGAGT 

AspProSerHisIleThrAlaGluAlaAlaGlyArgArgLeuAlaArgGlySerPr^ 
4381 CTGATCCCTCCCATATAACAGCAGAGGCGGCCGGGCGAAGGTTGGCGAGGGGATCACa 
GACTAGGGAGGGTATATTGTCGTCTCCGCCGGCCCGCTTCCAACCGCTCCCCTAGTGGGG 

SerVa 1 A 1 aSe r Se r Se r A 1 aSe r G 1 nLeuSe r A 1 aP roSe r LeuLy s A 1 aThrCy sTh r 
4441 CCTCTGTGGCCAGCTCCTCGGCTAGCCAGCTATCCGCTCCATCTgCAAGGCAACTTGCA 
GGAGACACCGGTCGAGGAGCCGATCGGTCGATAGGCGAGGTAGAGAGTTCCGTTGAACGT 

AlaAsnHisAspSerProAspAlaGluLeuIleGluAlaAsnLeuLeuTrpArgGlnGlu 
4501 CCGCTAACCATGACTCCCCTGATGCTGAGCTCATAGAGGCCAACCTCCTATGGAGGCAGG 
GGCGATTGGTACTGAGGGGACTACGACTCGAGTATCTCCGGTTGGAGGATACCTCCGTCC 

MetG 1 yG I yAsn 1 1 eThr A rgVa lGluSerGl uAsnLysVa 1 Va 1 1 1 eLeuAspSerPhe 
4561 AGATGGGCGGCAACATCACCAGGGTTGAGTCAGAAAACAAAGTGGTGATTCTGGACTCCT 
TCTACCCGCCGTTGTAGTGGTCCCAACTCAGTCTTTTGTTTCACCACTAAGACCTGAGGA 

AspProLeuVa 1 A 1 aG 1 uG 1 u As pG 1 uA rgG 1 u 1 1 eSe rVa 1 P r o A 1 aG 1 u 1 1 eLeu A rg 
4621 TCGATCCGCTTGTGGCGGAGGAGGACGAGCGSGAGATCTCCGTACtt^ 

AGCTAGGCGAACACCGCCTCCTCCTGCTCGCCCTCTAGAGGCATGGGCGTCTTTAGGACG 

LysSerArgArgPheAlaGlnAlaLeuProValTrpAlaArgProAspTyrAsnProP^ 
4681 GGAAGTCTCGGAGATTCGCCCAGGCCCTGCCCGTTTGGGCGCGGCCGGACTATAACCCCC 
CCTTCAGAGCCTCTAAGCGGGTCCGGGACGGGCAAACCCGCGCCGGCCTGATATTGGGGG 

LeuValGluThrTrpLysLysProAspTyrGluProProValValHisGlyCysPr^ 
4741 CGCTAGTGGAGACGTGGAAAAAGCCCGACTACGAACCACCTGTGGTCCATGGCTGTCCGC 
GCGATCACCTCTGCACCTTTTTCGGGCTGATGCTTGGTGGACACCAGGTACCGACAGGCG 

ProProProLysSerProProValProProProArgLysLysArgThrValValLeghr 
4801 TTCCACCTCCAAAGTCCCCTCCTGTGCCTCCGCCTCGGAAGAAGCGGACGGTGGTCCTCA 
AAGGTGGAGGTTTCAGGGGAGGACACGGAGGCGGAGCCTTCTTCGCCTGCCACCAGGAGT 

G 1 uSe rTh r LeuSe rTh r A 1 aLeuA 1 aG 1 uLeuA 1 aTh rA rgSe rPheG 1 ySe rSe rSe r 
4861 CTGAATCAACCCTATCTACTGCCTTGGCCGAGCTCGCCACCAGAAGCTTTGGCAGCTCCT 
GACTTAGTTGGGATAGATGACGGAACCGGCTCGAGCGGTGGTCTTCGAAACCGTCGAGGA 

ThrSerG 1 y 1 1 eThrG 1 yAspAsnThrTh rThrSerSerG 1 uProA 1 aProSerG 1 yCys 
4921 CAACTTCCGGCATTACGGGCGACAATACGACAACATCCTCTGAGCCCGCCCCTTCTGGCT 
GTTGAAGGCCGTAATGCCCGCTGTTATGCTGTTGTAGGAGACTCGGGCGGGGAAGACCGA 

ProProAspSerAspAlaGluSerTyrSerSerMetProProLeuGluGlyGluProGly 
4981 GCCCCCCCGACTCCGACGCTGAGTCCTATTCCTCCATGCCCCCCgGGAGGGGGAG^ 
CGGGGGGGCTGAG6CTGCGACTCAGGATAAGGAGGTACGGGGGGGACCTCCCCCTCGGAC 



FIG. 26F 

CCCTAGGCCTAGAATC6CT6CCCA6TACCAGTT6CCAGTCATCACTCCG6TTGC6CCTCC 

Va 1 Va 1 Cy sCy sSe rMe tSe rTy rSe rT rpTh r G 1 y A 1 aLeuVa ITh r P roCys A 1 a A 1 a 
5101 ATGTCGTGTGCTGCTCAATGTCTTACTCTTGGACAGGCGCACTCGTCACCC^ 

TACAGCACACGACGAGTTACAGAATGAGAACCTGTCCGCGTGAGCAGTGGGGCACGCGGC 

G 1 uG 1 uG 1 nLy sLeuPro 1 1 e AsnAl aLeuSe rAsnSe rLeuLeuArgH i sH i s AsnLeu 
5161 CGGAAGAACAGAAACTGCCCATCAATGCACTAAGCAACTCGTTGCTACGTCACC^ 

GCCTTCTT6TCTTTGACGGGTAGTTACGTGATTCGTTGAGCAACGATGCAGTGGTGTTAA 

Va 1 Ty rSe rTh rTh rSe r A rgSe r A 1 aCy sG 1 nA r gG 1 nLy sLy sVa 1 Th r Phe AspA rg 

5221 IKM^^ 

5281 «i«r^^ 

CTGACGTTCAAGACCTGTCGGTAATGGTCCTGCATGAGTTCCTCCAATTTCGTCGCCGCA 

LysValLysAlaAsnLeu 
5311 CAAAAGTGAAGGCTAACTTG 
GTTTTCACTTCCGATTGAAC 



61 



FIG. 30 

i aiitK 

CCCCCCCTCTTGACGCCGATAGCGTCCACGGCGCGTTCGCCGCATGACTGTTGATCGACA 

GlyAsnThrLeuThrCysTyrlleLysAlaArgAlaAlaCysArgAla^ 
GGTAACACCCTCACTTGTTACATCAAGGCCCGAGCAGCCTGTCGAG^ 
CCATTGTGGGAGTGAACAATGTAGTTCCGGGCTCGTCGGACAGCTCGGCGTCCCGAGGTC 

Overlap with 19g 

AspCysThrMetLeuValCysGlyAspAspLe^ 
121 GACTGCACCATGCTCGTGTGTGGCGACGACTTAGTCGTTATCTGTGAAAGCGCGGGG^ 
CTGACGTGGTACGAGCACACACCGCTGCTGAATCAGCAATAGACACTTTCGCGCCCCCAG 

G 1 nG 1 uAspAlaA 1 aSe r LeuArgA 1 aPheThrG 1 uA 1 aMetThrArgTy rSerA 1 aPro 
CAGGAGGACGCGGCGAGCCTGAGAGCCTTCACGGAGGCTATGACCAGGTACTCC^ 
GTCCTCCTGCGCCGCTCGGACTCTCGGAAGTGCCTCCGATACTGGTCCATGAGGCGGGGG 

ProGlyAspProProGlnProGluTyrAspLeuGluLeuIleThrSerCysSerSerAsn 
241 CCTGGSGACCCCCCACAACCAGAATACGACTTGGAGCTCATA^ 

GGACCCCTGGGGGGTGTTGGTCTTATGCTGAACCTCGAGTATTGTAGTACGAGGAGGTTG 

Va 1 SerVa 1 A 1 aH i sAspG 1 yA 1 aG lyLys ArgVa lTy rTy rLeuThrA rgAspProThr 
301 GTGTCAGTCGCCCACGACGGCGCTGGAAAGAGGGTCTACTACCTCACCCGTG^ 

CACAGTCAGCGGGTGCTGCCGCGACCTTTCTCCCAGATGATGGAGTGGGCACTGGGATGT 

ThrProLeuA I aAr gA 1 aA 1 aTr pG 1 uThr A 1 a A rgH i s W 
361 ACCCCCCTCGCGAGAGCTGCGTGGGAGACAGCAAGACACAC^ 

TGGGGGGAGCGCTCTCGACGCACCCTCTGTCGTTCTGTGTGAGGTCA6TTAAGGACCGAT 

GlyAsnl lei leMetPheAlaProThrLeuTrpM 
421 GGCAACATAATCATGTTTGCCCCCACACTGTGGGCG 
CCGTTGTATTAGTACAAACGGGGGTGTGACACCCGC 
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FIG. 28 

Overlap with 39c 

LeuLysGluValLysAlaAlaAlaSerLysValLysAlaAsnLeuLeuSerValGluGlu 
1 TGCTCAAGGAGGTTAAAGCAGCGGCGTCAAAAGTGAAGGCTAACTTGCTATCCGTAGAGG 
ACGAGTTCCTCCAATTTCGTCGCCGCAGTTTTCACTTCCGATTGAACGATAGGCATCTCC 

AlaCysSerLeuThrProProHisSerAlaLysSerLysPheGlyTyrGlyAlaLysAsp 
61 AAGCTTGCAGCCTGACGCCCCCACACTCAGCCAAATCCAAGTTTGGTTATGGGGCAAAAG 
TTCGAACGTCGGACTGCGGGGGTGTGAGTCGGTTTAGGTTCAAACCAATACCCCGTTTTC 

ValArgCysHisAlaArgLysAlaValThrHisIleAsnSerValTrpLysAspLeuLeu 
1 2 1 ACGTCCGTTGCCATGCCAGAAAGGCCGTAACCCACATCAACTCCGTGTGGAAAGACCTTC 
TGCAGGCAACGGTACGGTCTTTCCGGCATTGGGTGTAGTTGAGGCACACCTTTCTGGAAG 

GluAspAsnValThrProIleAspThrThrlleMetAlaLysAsnGluValPheCysVal 
181 TGG AAG ACAATGTAACACCAATAGACACTACCATCATGGCTAAGAACG AGGTTTTCTGCG 
ACCTTCTGTTACATTGTGGTTATCTGTGATGGTAGTACCGATTCTTGCTCCAAAAGACGC 

GlnProGluLysGIyGlyArgLysProAlaArgLeuIleValPheProAspLeuGlyVal 
241 TTC AGCCTG AGAAGGGGGGTCGTAAGCCAGCTCGTCTCATCGTGTTCCCCGATCTGGGCG 
AAGTCGGACTCTTCCCCCCAGCATTCGGTCGAGCAGAGTAGCACAAGGGGCTAGACCCGC 

ArgValCysGluLysMetAlaLcuTyrAspValValThrLysLeuProLeuAlaValMet 
301 TGCGCGTGTGCGAAAAG ATGGCTTTGTACG ACGTGGTTACAAAGCTCCCCTTGGCCGTG A 
ACGCGCACACGCTTTTCTACCGAAACATGCTGCACCAATGTTTCGAGGGGAACCGGCACT 

GlySerSerTyrGlyPheGlnTyrSerProGlyGlnArgValGluPheLeuValGlnAla 
361 TGGGAAGCTCCTACGGATTCCAATACTCACCAGGACAGCGGGTTGAATTCCTCGTGCAAG 
ACCCTTCGAGGATGCCTAAGGTTATGAGTGGTCCTGTCGCCCAACTTAAGGAGCACGTTC 

TrpLysSerLysLysThrProMetGlyPheSerTyrAspThrArgCysPheAspSerThr 
421 CGTGGAAGTCCAAG AAAACCCCAATGGGGTTCTCGTATGATACCCGCTGCTTTG ACTCCA 
GCACCTTCAGGTTCTTTTGGGGTTACCCCAAGAGCATACTATGGGCGACGAAACTGAGGT 

ValThrGluSerAspIleArgThrGluGluAla 
4 81 CAGTCACTG AGAGCGACATCCGTACGGAGG AGGCA 
GTCAGTGACTCTCGCTGTAGGCATGCCTCCTCCGT 



FIG. 29 



GluPheLeuValGlnAlaTrpLysSerLysLysThrProMetGlyPheSerTyrAspThr 
1 GAATTCCTCGTGCAAGCGTGGAAGTCCAAGAAAACCCCAATGGGGTTCTCGTATGATACC 
CTTAAGGAGCACGTTCGCACCTTCAGGTTCTTTTGGGGTTACCCCAAGAGCATACTATGC 

Overlap with 35f 

ArgCysPheAspSerThrValThrGluSerAspIleArgThrGluGluAlalleTyrGln 
61 CGCTGCTTTGACTCCACAGTCACTGAGAGCGACATCCGTACGGAGGAGGCAATCTACCAA 
GCGACGAAACTGAGGTGTCAGTGACTCTCGCTGTAGGCATGCCTCCTCCGTTAGATGGTT 

CysCysAspLeuAspProGlnAlaArgValAlalleLysSerLeuThrGluArgLeuTyr 
121 TGTTGTGACCTCGACCCCC AAGCCCGCGTGGCCATCAAGTCCCTCACCGAG AGGCTTTAT 
ACAACACTGGAGCTGGGGGTTCGGGCGCACCGGTAGTTCAGGGAGTGGCTCTCCGAAATA 

valGlyGlyProLeuThrAsnSerArgGlyGluAsnCysGlyTyrArgArgCysArgAla 
181 GTTGGGGGCCCTCTTACCAATTCAAGGGGGGAGAACTGCGGCTATCGCAGGTGCCGCGCG 
CAACCCCCGGGAGAATGGTTAAGTTCCCCCCTCTTGACGCCGATAGCGTCCACGGCGCGC 

SerGlyValLeuThrThrSerCysGlyAsnThrLeuThrCysTyrlleLysAlaArgAla 
241 AGCGGCGTACTGAC AACTAGCTGTGGTAAC ACCCTCACTTGCTACATCAAGGCCCGGGC A 
TCGCCGCATGACTGTTGATCGACACCATTGTGGGAGTGAACGATGTAGTTCCGGGCCCGT 

AlaCysArgAlaAlaGlyLeuGlnAspCysThrMetLeuValCysGlyAspAspLeuVal 
301 GCCTGTCGAGCCGCAGGGCTCCAGGACTGCACCATGCTCGTGTGTGGCGACGACTTAGTC 
CGGACAGCTCGGCGTCCCGAGGTCCTGACGTGGTACGAGCACACACCGCTGCTGAATCAG 

VallleCysGluSerAlaGlyValGlnGluAspAlaAla 
361 G TTATCTGTG AAAGCGCGGGGG TCCAGG AGG ACGCGGCG AG 
CAATAGACACTTTCGCGCCCCCAGGTCCTCCTGCGCCGCTC 
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# FIG. 32A 



, ^IlePheLysIleArgMetTyrValGlyGlyValGluHisArgLeuGluAlaA 
1 CCATATTTAAAATCAGGATGTACGTGGGAGGGGTCGAACACAGGCT6GAAGCTGCCTGCA 
GGTATAAATTTTAGTCCTACATGCACCGTCCCCAGCTTGTGTCCGACCTTCGACGGACGT. 

TrpThrArgGlyGluArgCysAspLeuGluAspArgAspArgSerGluLeuSerProLeu 
61 ACTGGACGCGGGGCGAACGTTGCGATCTGGAAGACAGGGACAGGTCCGAGCTCAGCCCGT 
TGACCTGCGCCCCGCTTGCAACGCTAGACCTTCTGTCCCTGTCCAGGCTCGAGTCGGGCA 

T LeuLeuThrThrThrGlnTrpGlnValLeuProCysSerPheThrThrLeuProAlaLeu 
121 TACTGCTGACCACTACACAGTGGCAGGTCCTCCCGTGTTCCTTCACAACCCTACCAGCCT 
ATGACGACTGGTGATGTGTCACCGTCCAGGAGGGCACAAGGAAGTGTTGGGATGGTCGGA 

, „ , SerTh rG lyLeu 1 1 eH i sLeuH i sG 1 nAsn 1 1 eVa 1 Aspva 1 G 1 nTy rLeuTy rG 1 yVa 1 
181 TGTCCACCGGCCTCATCCACCTCCACCAGAACATTGTGGACGTGCAGTACTTGTACGGGG 
ACAGGTGGCCGGAGTAGGTGGAGGTGGTCTTGTAACACCTGCACGTCATGAACATGCCCC 

01 , GlySerSerlleAlaSerTrpAlalleLysTrpGluTyrValValLeuLeuPheLeuLeu 
2^1 TGGGGTCAAGCATCGCGTCCTGGGCCATTAAGTGGGAGTACGTCGTTCTCCTGTTCCTTC 
ACCCCAGTTCGTAGCGCAGGACCCGGTAATTCACCCTCATGCAGCAAGAGGACAAGGAAG 

™, LeuA 1 aAspA 1 aArgVa 1 CysSe rCysL'euTr pMe tMetLeuLeu 1 1 eSerG 1 nA 1 aG 1 u 
301 TGCTTGCAGACGCGCGCGTCTGCTCCTGCTTGTGGATGATGCTACTCATATCCCAAGCGG 
ACGAACGTCTGCGCGCGCAGACGAGGACGAACACCTACTACGATGAGTATAGGGTTCGCC 

A 1 aA 1 aLeuG 1 uAsnLeuVa 1 1 1 eLeuAsnA 1 aA 1 aSerLeuA 1 aG 1 yThr H i sG 1 yLeu 
361 AGGCGGCTTTGGAGAACCTCGTAATACTTAATGCAGCATCCCTGGCCGGGACGCACGGTC 
TCCGCCGAAACCTCTTGGAGCATTATGAATTACGTCGTAGGGACCGGCCCTGCGTGCCAG 

Va lSerPheLeuVa 1 PhePheCysPheA laTrpTy rLeuLysG lyLysTrpVa lProGly 
421 TTGTATCCTTCCTCGTGTTCTTCTGCTTTGCATGGTATTTGAAGGGTAAGTGGGTGCCCG 
AACATAGGAAGGAGCACAAGAAGACGAAACGTACCATAAACTTCCCATTCACCCACGGGC 

, « i o » £ \ ^yaJUyrThrPhe Ty rG lyMetTr pProLeuLeuLeuLeuLeuLeuA 1 aLeuProG 1 n 
181 GAGCGGTCTACACCTTCTACGGGATGTGGCCTCTCCTCCTGCTCCTGTTGGCGTTGCCCC 
CTCGCCAGATGTGGAAGATGCCCTACACCGGAGAGGAGGACGAGGACAACCGCAACGGGG 

^ , , . „ A rgA 1 aTy rA 1 aLeu AspTh rG 1 uVa 1 A 1 aA 1 aSe rCy sG 1 yG 1 y Va 1 Va 1 LeuVa 1 G 1 y 
541 AGCGGGCGTACGCGCTGGACACGGAGGTGGCCGCGTCGTGTGGCGGTGTTGTTCTCGTCG 
TCGCCCGCATGCGCGACCTGTGCCTCCACCGGCGCAGCACACCGCCACAACAAGAGCAGC 

^ ^^LeuMetAlaLeuThrLeuSerProTyrTyrLysArgTyrlleSerTrpCysLeuTrpTrp 
601 GGTTGATGGCGCTGACTCTGTCACCATATTACAAGCGCTATATCAGCTGGTGCTTGTGGT 
CCAACTACCGCGACTGAGACAGTGGTATAATGTTCGCGATATAGTCGACCACGAACACCA 

LeuGlnTvrPte^ 

661 GGCTTCAGTATTTTCTGACCAGAGTGGAAGCGCAACTGCACGTGTGGATTCCCCCCCTCA 
CCGAAGTCATAAAAGACTGGTCTCACCTTCGCGTTGACGTGCACACCTAAGGGGGGGAGT 

7 „ A ValArgGlyGlyArgAspAlaVaU 

721 ACGTCCGAGGGGGGCGCGACGCCGTCATCTTACTCATGTGTGCTGTACACCCGACTCTGG 
TGCAGGCTCCCCCCGCGCTGCGGCAGTAGAATGAGTACACACGACATGTGGGCTGAGACC 

701 T PheAspIleThrLysUuLeuLeuAlaValPheGlyP 

781 TATTTGACATCACCAAATTGCTGCTGGCCGTCTTCGGACCCCTTTGGATTCTTCAAGCCA 
ATAAACTGTAGTGGTTTAACGACGACCGGCAGAAGCCTGGGGAAACCTAAGAAGTTCGGT 

LeuLeuLysValProTyrPheValArgValGlnGlyLeuLeuArgPheCysAlaLeuAla 
841 GTTTGCTTAAAGTACCCTACTTTGTGCGCGTCCAAGGCCTTCTCCGGTTCTGCGCGTTAG 
CAAACGAATTTCATGGGATGAAACACGCGCAGGTTCCGGAAGAGGCCAAGACGCGCAATC 
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ArgLy sMet 1 1 eG 1 yG 1 yH i sTy rVa 1 G 1 nMetVa 1 1 1 e 1 1 eLy sLeuG 1 yA 1 aLeuThr 
901 CGCGGAAGATGATCGGAGGCCATTACGTGCAAAT6GTCATCATTAAGTTAGGGGCGCTTA 
GCGCCTTCTACTAGCCTCCGGTAATGCACGTTTACCAGTAGTAATTCAATCCCCGCGAAT 

G lyThrTy rVa 1 TyrAsnH i SLeuThrProLeuArgAspTrpA 1 aH i sAsnG 1 yLeuArg 
961 CTGGCACCTATGTTTATAACCATCTCACTCCTCTTCGGGACTGGGCGCACAACGGCTTGC 
GACCGTGGATACAAATATTGGTAGAGTGAGGAGAAGCCCTGACCCGCGTGTTGCCGAACG 

AspLeuAlaValAlaValGluProValValPheSerGlnMetGluThrLysLeuIleThr 
1021 GAGATCTGGCCGTGGCTGTAGAGCCAGTCGTCTTCTCCCAAATGGAGACCAAGCTCATCA 
CTCTAGACCGGCACCGACATCTCGGTCAGCAGAAGAGGGTTTACCTCTGGTTCGAGTAGT 

TrpGlyAlaAspThrAlaAlaCysGlyAspIlelleAsnGlyLeuProValSerAlaArg 
1081 CGTGGGGGGCAGATACCGCCGCGTGCGGTGACATCATCAACGGCTTGCCTGTTTCCGCCC 
GCACCCCCCGTCTATGGCGGCGCACGCCACTGTAGTAGTTGCCGAACGGACAAAGGCGGG 

ArgG 1 yArgG 1 u 1 1 eLeuLeuG 1 yProA 1 aAspG 1 yMetVa 1 SerLysG 1 yTrpArgLeu 
1141 GCAGGGGCC6GGAGATACTGCTCGGGCCAGCCGATGGAATGGTCTCCAAGGGGTGGAGGT 
CGTCCCCGGCCCTCTATGACGAGCCCGGTCGGCTACCTTACCAGAGGTTCCCCACCTCCA 

LeuAlaProI leThrAlaTyrAlaGlnGlnThrArgGlyLeuLeuGlyCysliel leThr 
1201 TGCTGGCGCCCATCACGGCGTACGCCCAGCAGACAAGGGGCCTCCTAGGGTGCATAATCA 
ACGACCGCGGGTAGTGCCGCATGCGGGTCGTCTGTTCCCCGGAGGATCCCACGTATTAGT 

Se rLeuTh rG 1 yArgAspLy sAsnG 1 nVa 1 G 1 uG 1 yG 1 uVa 1 G 1 n 1 1 eVa 1 SerThrA 1 a 
1261 CCAGCCTAACTGGCCGGGACAAAAACCAAGTGGAGGGTGAGGTCCAGATTGTGTCAACTG 
GGTCGGATTGACCGGCCCTGTTTTTGGTTCACCTCCCACTCCAGGTCTAACACAGTTGAC 

A 1 aG InThrPheLeuA 1 aThrCys 1 1 eAsnG 1 yVa 1 CysTrpThrVa 1 Ty rH i sG 1 yA 1 a 
1321 CTGCCCAAACCTTCCTGGCAACGTGCATCAATGGGGTGTGCTGGACTGTCTACCACGGGG 
GACGGGTTTGGAAGGACCGTTGCACGTAGTTACCCCACACGACCTGACAGATGGTGCCCC 

GlyThrArgThrlleAlaSerProLysGlyProVallleGlnMetTyrThrAsnValAsp 
1381 CCGGAACGAGGACCATCGCGTCACCCAAGGGTCCTGTCATCCAGATGTATACCAATGTAG 
GGCCTTGCTCCTGGTAGCGCAGTGGGTTCCCAGGACAGTAGGTCTACATATGGTTACATC 

GlnAspLeuValGlyTrpProAlaProGlnGlySerArgSerLeuThrProCyrThrCys 
1441 ACCAAGACCTTGTGGGCTGGCCCGCTCCGCAAGGTAGCCGCTCATTGACACCCTGCACTT 
TGGTTCTGGAACACCCGACCGGGCGAGGCGTTCCATCGGCGAGTAACTGTGGGACGTGAA 

GlySerSerAspLeuTyrLeuValThrArgHisAlaAspVal I leProValArgArgArg 
1501 GCGGCTCCTCGGACCTTTACCTGGTCACGAGGCACGCCGATGTCATTCCCGTGCGCCGGC 
CGCCGAGGAGCCTGGAAATGGACCAGTGCTCCGTGCGGCTACAGTAAGGGCACGCGGCCG 

GlyAspSerArgGlySerLeuLeuSerProArgProNeSerTyrLeuLysGlySerSer 
1 56 1 GGGGTGATAGCAGGGGCAGCCTGCTGTCGCCCCGGCCCATTTCCTACTTGAAAGGCTCCT 
CCCCACTATCGTCCCCGTCGGACGACAGCTGGGCCGGGTAAAGGATGAACTTTCCGAGGA 

GlyGl yProLeuLeuCy sProA 1 aG 1 yH i sA 1 aVa 1 G 1 y 1 1 ePheArgA 1 aA 1 aVa 1 Cys 
1621 CGGGGGGTCCGCTGTTGTGCCCCGCGGGGCACGCCGTGGGCATATTTAGGGCCGCGGTGT 
GCCCCCCAGGCGACAACACGGGGCGCCCCGTGCGGCACCCGTATAAATCCCGGCGCCACA 

ThrArgGlyValAlaLysAlaValAspPhelleProValGluAsnLeuGluThrThrMet 
1681 GCACCCGTG6AGTGGCTAAGGCGGTGGACTTTATCCCTGTGGAGAACCTAGAGACAACCA 
CGTGGGCACCTCACCGATTCCGCCACCTGAAATAGGGACACCTCTTGGATCTCTGTTGGT 
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ArgSerProVa 1 PheThrAspAsnSe rSerProRroVa lVa 1 ProG 1 nSerPheG 1 nVa 1 
1741 TGAGGTCCCCGGTGTTCACGGATAACTCCTCTCCACCAGTAGTGCCCCAGAGCTTCCAGG 
ACTCCAGGGGCCACAAGTGCCTATTGAGGAGAGGTGGTCATCACGGGGTCTCGAAGGTCC 

AlaHisLeuHlsAlaProThrGlySerGlyLysSerThFLysValProAlaAlaTyrAla 
1801 TGGCTCACCTCCATGCTCCCACAGGCAGCGGCAAAAGCACCAAGGTCGCGGCTGCATATG 
ACCGAGTGGAGGTACGAGGGTGTCCGTCGCCGTTTTCGTGGTTCCAGGGCCGACGTATAC 

AlaGlnG 1 yTy rLysVal LeuVa 1 LeuAsnProSerVa 1 A 1 a A 1 aTh r LeuG 1 yPheG 1 y 
1861 CAGCTCAGGGCTATAAGGTGCTAGTACTCAACCCCTCTGTTGCTGCAACACTGGGCTTTG 
GTCGAGTCCCGATATTCCACGATCATGAGTTGGGGAGACAACGACGTTGTGACCCGAAAC 

AlaTyrMetSerLysAlaHisGlylleAspProAsnlleArgThrGlyValArgThrlle 
1921 GTGCTTACATGTCCAAGGCTCATGGGATCGATCCTAACATCAGGACCGGGGTGAGAACAA 
CACGAAT6TACAGGTTCCGAGTACCCTAGCTAGGATTGTAGTCCTGGCCCCACTCTTGTT 

Th rTh rG 1 ySe r P ro 1 1 eTh rTy rSe rTh rTy rG 1 yLy sPheLeuA 1 a AspG 1 yG 1 yCy s 
1981 TTACCACTGGCAGCCCCATCACGTACTCCACCTACGGCAAGTTCCTTGCCGACGGCGGGT 
AATGGTGACCGTCGGGGTAGTGCATGAGGTGGATGCCGTTCAAGGAACGGCTGCCGCCCA 

SerGlyGlyAlaTyrAspI lei lei leCysAspGluCysHisSerThrAspAlaThrSer 
2041 GCTCGGGGGGCGCTTATGACATAATAATTTGTGACGAGTGCCACTCCACGGATGCCACAT 
CGAGCCCCCCGCGAATACTGTATTATTAAACACTGCTCACGGTGAGGTGCCTACGGTGTA 

1 1 eLeuG 1 y 1 1 eG 1 yThrVa 1 LeuAspG 1 nA 1 aG 1 uThrA 1 aG 1 yA 1 a A rgLeuVa 1 Va 1 
2101 CCATCTTGGGCATCGGCACTGTCCTTGACCAAGCAGAGACTGCGGGGGCGAGACTGGTTG 
GGTAGAACCCGTAGCCGTGACAGGAACTGGTTCGTCTCTGACGCCCCCGCTCTGACCAAC 

LeuA 1 aThr A 1 aTh r P roP roG 1 y Se rVa 1 Th rVa 1 P roH i sP ro Asn 1 1 eG 1 uG 1 uVa 1 
2161 TGCTCGCCACCGCCACCCCTCCGGGCTCCGTCACTGTGCCCCATCCCAACATCGAGGAGG 
ACGAGCGGTGGCGGTGGGGAGGCCCGAGGCAGTGACACGGGGTAGGGTTGTAGCTCCTCC 

AlaLeuSerThrThrGlyGluIleProPheTyrGlyLysAlalleProLeuGluVallle 
2221 TTGCTCTGTCCACCACCGGAGAGATCCCTTTTTACGGCAAGGCTATCCCCCTCGAAGTAA 
AACGAGACAGGTGGTGGCCTCTCTAGGGAAAAATGCCGTTCCGATAGGGGGAGCTTCATT 

LysG lyG lyArgH isLeuI 1 ePheCysHl sSerLysLysLysCysAspG luLeuA laAla 
2281 TCAAGGGGGGGAGACATCTCATCTTCTGTCATTCAAAGAAGAAGTGCGACGAACTCGCCG 
AGTTCCCCCCCTCTGTAGAGTAGAAGACAGTAAGTTTCTTCTTCACGCTGCTTGAGCGGC 

LysLeuValAlaLeuGlylleAsnAlaValAlaTyrTyrArgGlyLeuAspValSerVal 
2341 CAAAGCTGGTCGCATTGGGCATCAATGCCGTGGCCTACTACCGCGGTCTTGACGTGTCCG 
GTTTC6ACCAGCGTAACCCGTAGTTACGGCACCGGATGATGGCGCCAGAACTGCACAGGC 

IleProThrSerGlyAspValValValValAlaThrAspAlaLeuMetThrGlyTyrThr 
2401 TCATCCCGACCAGCGGCGATGTTGTCGTCGTGGCAACCGATGCCCTCATGACCGGCTATA 
AGTAGGGCTGGTCGCCGCTACAACAGCAGCACCGTTGGCTACGGGAGTACTGGCCGATAT 

G lyAspPheAspSerVa 1 1 1 eAspCys AsnThrCysVa ITh rG 1 nTh rVa 1 AspPheSe r 
2461 CCGGCGACTTCGACTCGGTGATAGACTGCAATACGTGTGTCACCCAGAGAGTCGATTTCA 
GGCCGCTGAAGCTGAGCCACTATCTGACGTTATGCACACAGTGGGTCTGTCAGCTAAAGT 

LeuAspProThrPheThrlleGluThrlleThrLeuProGlnAspAlaValSerArgThr 
2521 GCCTTGACCCTACCTTCACCATTGAGACAATCACGCTCCCCCAGGATGCTGTCTCCCGCA 
CGGAACTGGGATGGAAGTGGTAACTCTGTTAGTGCGAGGGGGTCCTACGACAGAGGGCGT 

GlnArgArgGlyArgThrGlyArgGlyLysProGlylleTyrArgPheValAlaProGly 
2581 CTCAACGTCGGGGCAGGACTGGCAGGGGGAAGCCAGGCATCTACAGATTTGTGGCACCGG 
GAGTTGCAGCCCCGTCCTGACCGTCCCCCTTCGGTCCGTAGATGTCTAAACACCGTGGCC 

GluArgProSerGlyMetPheAspSerSerValLeuCysGluCysTyrAspAlaGlyCys 
2641 GGGAGCGCCCCTCCGGCATGTTCGACTCGTCCGTCCTCTGTGAGTGCTATGACGCAGGCT 
CCCTCGCGGGGAGGCCGTACAAGCTGAGCAGGCAGGAGACACTCACGATACTGCGTCCGA 

AlaTrpTyrGluLeuThrProAlaGluThrThrValArgLeuArgAlaTyrMetAsnThr 
2701 GTGCTTGGTATGAGCTCACGCCCGCCGAGACTACAGTTAGGCTACGAGCGTACATGAACA 
CACGAACCATACTCGAGTGCGGGCGGCTCTGATGTCAATCCGATGCTCGCATGTACTTGT 
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ProG 1 y LeuPr oVa 1 CysG 1 n AspH i sieuG 1 uPheTr PG 1 uG'l y Va 1 PheTh rG 1 y Leu 
2761 CCCCGGG6CTTCCCGTGTGCCAGGACCATCTTGAATTTTGGGAGGGCGTC7TTACAGGCC 
6GGGCCCCGAAGGGCACACGGTCCTGGTAGAACTTAAAACGGTCCCGCAGAAATGTCCGG 

ThrH 1 s I 1 eAspAlaH i sPheLeuSe rG lnthrLysG 1 nSerG 1 yG 1 uAsnLeuProTyr 
2821 TCACTCATATAGATGCCCACTTTCTATCCCAGACAAAGCAGAGTGGGGAGAAa 

A6TGA6TATATCTACGGGTGAAAGATAGGGTCTGTTTCGTCTCACCCCTCTTGGAAGGAA 

LeuVa 1 A 1 aTy rG InA 1 aThrVa 1 CysA 1 aA rgA 1 aG 1 nA 1 aProProProSerTr pAsp 
2881 ACCTGGTAGCGTACCAAGCCACCGTGTGCGCTAGGGCTCAAGCCCCTCCCCCATCGTGW 
TGGACCATCGCATGGTTCGGTGGCACACGCGATCCCGAGTTCGGGGAGGGGGTAGCACCC 

GlnMetTrpLysCysLeuIleArgLeuLysProThrLeuHisGlyProThrProLe^ 
2941 ACCAGATGTGGAAGTGTTTGATTCGCCTCAAGCCCACCCTCCATGGGCCAACACa 

TGGTCTACACCTTCACAAACTAAGCGGAGTTCGGGTGGGAGGTACCCGGTTGTGGGGACG 

Ty r A rgLeuG 1 y A 1 aVa 1 G 1 n AsnG 1 u 1 1 eTh r LeuTh r H i sPr oVa ITh rLy sTy r I le 
3001 TATACAGACTGGGCGCTGTTCAGAATGAAATCACCCTGACGCACCCAGTCACCAAATACA 
ATATGTCTGACCCGCGACAAGTCTTACTTTAGTGGGACTGCGTGGGTCAGTGGTTTATGT 

MetThrCysMetSerAlaAspLeuGluVaiyalThrSerTh^rpVaR 
3061 TCATGACATGCATGTCGGCCGACCTGGAGGTCGTCACGAGCACCTGGGTGCTC^ 

AGTACTGTACGTACAGCCGGCTGGACCTCCAGCAGTGCTCGTGGACCCACGAGCAACCGC 

Va lLeuAl aA 1 aLeuA 1 aA 1 aTy rCysLeuSerThrG 1 yCysVa 1 Va 1 I leVa 1G 1 yArg 
3121 GCGTCCTGGCTGCTnGGCCGCGTATTGCCTGTCAACAG^ 

CGCAGGACCGACGAAACCGGCGCATAACGGACAGTTGTCCGACGCACCAGTATCACCCGT 

ValValLeuSerGlyLysProAlallelleProAspArgGlu^ 
3181 GGGTCGTCTTGTCCGGGAAGCCGGCAATCATACCTGACAGGGAAGTCCTCTACCGAGAGT 
CCCAGCAGAACAGGCCCTTCGGCCGTTAGTATGGACTGTCCCTTCAGGAGATGGCTCTCA 

3241 TCGA?GAGATGGAAGAGT^cfcTC^ 

AGCTACTCTACCTTCTCACGAGAGTCGTGAATGGCAT6TAGCTCGTTCCCTACTACGAGC 

G 1 uG 1 nPheLysG 1 nLysA 1 aLeuG 1 yLeuLeuG 1 ^ ^ 
3301 CCGAGCAGTTCAAGCAGAAGGCCCTCGGCCTCCTGCAGACCGCGTCCCGTCAGGQ 

GGCTCGTCAAGTTCGTCTTCCGGGAGCCGGAGGACGTCTGGCGCAGGGCAGTCCGTCTCC 

IleAlaProAlaValGlnThrAsnTrpGlnLysLeuGluThrPheTrpAlaLys^ 
3361 TTATCGCCCCTGCTGTCCAGACCAACTGGCAAAAACTCGAGACCTTCTGGGCGAA^ 

AATAGCGGGGACGACAGGTCTGGTTGACCGTTTTTGAGCTCTGGAAGACCCGCTTCGTAT 

TrpAsnPhel leSerGly I leGlnTyrLeuAlaGlyLeuSerThrLeuProGlyAs^ 
3421 TGTGGAACTTCATCAGTGGGATACAATACTTGGCGGGCTTGTCAACGCTGCCTGGT AACC 
ACACCTTGAAGTAGTCACCCTATGTTATGAACCGCCCGAACAGTTGCGACGGACCATTGG 

A 1 a 1 1 e A 1 aSe r LeuMe tA 1 aPheTh r A 1 aA 1 aVa 1 Th r Se r ProLeyTh rTh rSe r6 1 n 
3481 CCGCCATTGCTTCATTGATGGCTTTTACAGCTGCTGTCAC^ 

GGCGGTAACGAAGTAACTACCGAAAATGTCGACGACAGTGGTCGGGTGATTGGTGATCGG 

ThrLeuLeuPheAsnlleLeuGlyGlyTrpValAlaAlaGlnLeuAl^ 
3541 AAACCCTCCTCTTCAACATATTGGGGGGGTGGGTGGCTGCCCAGCTCGCCGCCCCCGGTG 
TTTGGGAGGAGAAGTTGTATAACCCCCCCACCCACCGACGGGTCGAGCGGCGGGGGCCAC 

A 1 aTh r A 1 a PheVa 1 G 1 y A 1 aG 1 y Leu A 1 aG 1 y A 1 a A 1 a H eG 1 ySerValGlyLeuG 1 y 
3601 CCGCTACTGCCTTTGTGGGCGCTGGCnAGCTGGCGCCGCCATC 

GGCGATGACGGAAACACCCGCGACCGAATCGACCGCGGCGGTAGCCGTCACAACCTGACC 

Lys Va 1 Leu 1 1 e Asp 1 1 eLeu A 1 aG 1 yTyrG 1 yA 1 aG 1 Wa 1 A I aG 1 yA 1 aLeuVa 1 A 1 a 
3661 GGAAGGTCCTCATAGACATCCTTGCAGGGTATGGCGCGGGCGTGGCGGGAGCTCn^ 
CCTTCCAGGAGTATCTGTAGGAACGTCCCATACCGCGCCCGCACCGCCCTCGAGAACACC 

PheLys 1 1 eHetSerG 1 yG 1 uVa 1 ProSerThrG 1 uAspLeuVa 1 AsnLeuLeuProA 1 a 
3721 CATTCAAGATCATGAGCGGTGAGGTCCCCTCCACGGAGGAC^ 

GTAAGTTCTAGTACTCGCCACTCCAGGGGAGGTGCCTCCTGGACCAGTTAGATGACGGGC 
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„ IleLeuSerProGlyAlaLeuValValGlyVaiValCysAlaAial leLeuArgArgHis 
3781 CCATCCTCTCGCCCG6A6CCCTCGTAGTCGGCGTGGTCTGTGCA6CAATACTGCGCCG6C 
GGTAGGAGAGCGGGCCTCGGGAGCATCAGCCGCACCAGACACGTCGTTATGACGCGGCCG 

„ ni , . ValGlyProGlyGluGlyAlaValGlnTrpMetAsnArgLeuI leAlaPheAlaSerArg 
3811 ACGTTGGCCCGGGCGAGGGGGCAGTGCAGTGGATGAACCGGCTGATAGCCTTCGCCTCCC 
TGCAACCGGGCCCGCTCCCCCGTCACGTCACCTACTTGGCCGACTATCGGAAGCGGAGGG 

, GlyAsnHisValSerProThrHisTyrValProGluSerAspAlaAlaAlaArgValThr 

3901 GGGGGAACCATGTTTCCCCCACGCACTACGTGCCGGAGAGCGATGCAGCTGCCCGCGTCA 
CCCCCTTG6TAGAAAGGGGGTGCGTGATGCACGGCCTCTCGCTACGTCGACGGGCGCAGT 

AlalleLeuSerSerLeuThrValThrGlnLeuLeuArgArgLeuHisGlnTrpIleSer 
3961 CTGCCATACTCAGCAGCCTCACTGTAACCCAGCTCCTGAGGCGACTGCACCAGTGGATAA 
GACGGTATGAGTCGTCGGAGTGACATTGGGTCGAGGACTCCGCTGACGTGGTCACCTATT 

IM , „ SerGluCysThrThrProCysSerGlySerTrpLeuArgAspIleTrpAspTrplleCys 
1021 GCTCGGAGTGTACCACTCCATGCTCCGGTTCCTGGCTAAGGGACATCTGGGACTGGATAT 
CGAGCCTCACATGGTGAGGTACGAGGCCAAGGACCGATTCCCTGTAGACCCTGACCTATA 

, nn , G 1 uVa lLeuSer AspPheLysThrTrpLeuLys A 1 aLysLeuMetProG 1 nLeuProGl y 
1081 GCGAGGTGTTGAGCGACTTTAAGACCTGGCTAAAAGCTAAGCTCATGCCACAGCTGCCTG 
CGCTCCACAACTCGCTGAAATTCTGGACCGATTTTCGATTCGAGTACGGTGTCGACGGAC 

,,,, ^IleProPheValSerCysGlnArgGlvTyrLysGlyValTrpArgValAspGlylleMet 
1111 GGATCCCCTTTGTGTCCTGCCAGCGCGGGTATAAGGGGGTCTGGCGAGTGGACGGCATCA 
CCTAGGGGAAACACAGGACGGTCGCGCCCATATTCCCCCAGACCGCTCACCTGCCGTAGT 

, ™, T J J sThrArgCysH 1 sCysG lyAlaGluI leThrGlyHi sVa 1 LysAsnG 1 yThrMetA rg 
1201 TGCACACTCGCTGCCACTGTGGAGCTGAGATCACTGGACATGTCAAAAACGGGACGATGA 
ACGTGTGAGCGACGGTGACACCTCGACTCTAGTGACCTGTACAGTTTTTGCCCTGCTACT 

^HeValGlyProArgThrCysArgAsnHetTrpSerGlyThrPheProIleAsnAlaTyr 
1261 GGATCGTCGGTCCTAGGACCTGCAGGAACATGTGGAGTGGGACCTTCCCCATTAATGCCT 
CCTAGCAGCCAGGATCCTGGACGTCCTTGTACACCTCACCCTGGAAGGGGTAATTACGGA 

. ThrThrGlyProCysThrProLeuProAlaProAsnTyrThrPheAlaLeuTrpArgVal 
1321 ACACCACGGGCCCCTGTACCCCCCTTCCTGCGCCGAACTACACGTTCGCGCTATGGAGGG 
TGTGGTGCCCGGGGACATGGGGGGAAGGACGCGGCTTGATGTGCAAGCGCGATACCTCCC 

, ,„ , Se r A 1 aG 1 uG 1 uTy rVa 1 G 1 u 1 1 e A r gG 1 nVa 1 G 1 yAspPheH I sTy rVa 1 Th r G 1 yHe t 
1381 TGTCTGCAGAGGAATATGTGGAGATAAGGCAGGTGGGGGACTTCCACTACGTGACGGGTA 
ACAGACGTCTCCTTATACACCTCTATTCCGTCCACCCCCTGAAGGTGATGCACTGCCCAT 

.... _TD£ ThrAs P AsnL euLysCysProCysGlnValProSerProGluPhePheThrGluLeu 
1111 TGACTACTGACAATCTCAAATGCCCGTGCCAGGTCCCATCGCCCGAATTTTTCACAGAAT 
ACTGATGACTGTTAGAGTTTACGGGCACGGTCCAGGGTAGCGGGCTTAAAAAGTGTCTTA 

AspG 1 y Va 1 ArgLeuH i s A rgPheA 1 aProProCy sLy sProLeuLeuA rgG 1 uG 1 uVa 1 
1501 TGGACGGGGTGCGCCTACATAGGTTTGCGCCCCCCTGCAAGCCCTTGCTGCGGGAGGAGG 
ACCTGCCCCACGCGGATGTATCCAAACGCGGGGGGACGTTCGGGAACGACGCCCTCCTCC 

1561 TAT CAT TCAG?GTAGgXcTCCA^ 1 " 

ATAGTAAGTCTCATCCTGAGGTGCTTATGGGCCATCCCAGCGTTAATGGAACGGTCGGGC 

. ProAspValAlaValLeuThrSerMetLeuThrAspProSerHlsIleThrAlaGluAla 
1621 AACCGGACGTGGCCGTGTTGACGTCCATGCTCACTGATCCCTCCCATATAACAGCAGAGG 
TTGGCCTGCACCGGCACAACTGCAGGTACGAGTGACTAGGGAGGGTATATTGTCGTCTCC 

,,.„, AlaGlyArgArgLeuAlaArgGlySerProProSerValAlaSerSerSerAlaSerGln 
1681 CGGCCGGGCGAAGGTTGGCGAGGGGATCACCCCCCTCTGTGGCCAGCTCCTCGGCTAGCC 
GCCGGCCCGCTTCCAACCGCTCCCCTAGTGGGGGGAGACACCGGTCGAGGAGCCGATCGG 

, _ , _LeuSerAlaProSerLeuLysAlaThrCysThrAlaAsnHisAspSerProAspAlaGlu 
1711 AGCTATCCGCTCCATCTCTCAAGGCAACTTGCACCGCTAACCATGACTCCCCTGATGCTG 
TCGATAGGCGAGGTAGAGAGTTCCGTTGAACGTGGCGATTGGTACTGAGGGGACTACGAC 



FIG. 32F 

Leulle61uAlaAsnLeuLeuTrpArgGlnGluMetGly61yAsnIleThrArgVal6lu 
1801 AGCTCATAGAGGCCAACCTCCTATGGAGGCAGGAGATGGGCGGCAACATCACCAGGGTTG 
TCGAGTATCTCCGGTTGGAGGATACCTCCGTCCTCTACCCGCCGTTGTA6TGGTCCCAAC 

SerG 1 uAsnLysValVal 1 1 eLeuAspSerPheAspProLeuVa 1 A laG luG 1 uAspG lu 
1861 AGTCAGAAAACAAAGTGGTGATTCTGGACTCCTTCGATCCGCTTGTGGCGGAGGAGGACG 
TCAGTCTTTTGTTTCACCACTAAGACCTGAGGAAGCTAGGCGAACACCGCCTCCTCCTGC 

A rgG 1 u 1 1 eSe rVa 1 P roA 1 aG 1 u 1 1 eLeuAr gLysSer A rgA rgPhe A 1 aG 1 nA 1 aLeu 
1921 AGCG6GAGATCTCCGTACCCGCAGAAATCCTGCGGAAGTCTCGGAGATTCGCCCAGGCCC 
TCGCCCTCTAGAGGCATGGGCGTCTTTAGGACGCCTTCAGAGCCTCTAAGCGGGTCCGGG 

ProVa lTrpAlaA rgProAspTy r AsnProProLeuVa 1 G 1 uThrTrpLysLysProAsp 
1981 TGCCCGTTTGGGCGCGGCCGGACTATAACCCCCCGCTAGTGGAGACGTGGAAAAAGCCCG 
ACGGGCAAACCCGCGCCGGCCTGATATTGGGGGGCGATCACCTCTGCACCTTTTTCGGGC 

Ty rG 1 uProProVa lValHl sG lyCysProLeuProProProLysSerProProVa 1 Pro 
50H1 ACTACGAACCACCTGTGGTCCATGGCTGTCCGCTTCCACCTCCAAAGTCCCCTCCTGTGC 
TGATGCTTGGTGGACACCAGGTACCGACAGGCGAAGGTGGAGGTTTCAGGGGAGGACACG 

ProProArgLysLysArgThrValValLeuThrGluSerThrLeuSerThrAlaLeuAla 
5101 CTCCGCCTCGGAAGAAGCGGACGGTGGTCCTCACTGAATCAACCCTATCTACTGCCTTGG 
GAGGCGGAGCCTTCTTCGCCTGCCACCAGGAGTGACTTAGTTGGGATAGATGACGGAACC 

GluLeuAlaThrArgSerPheGlySerSerSerThrSerGlylleThrGlyAspAsnThr 
5161 CCGAGCTCGCCACCAGAAGCTTTGGCAGCTCCTCAACTTCCGGCATTACGGGCGACAATA 
GGCTCGAGCGGTGGTCTTCGAAACCGTCGAGGAGTTGAAGGCCGTAATGCCCGCTGTTAT 

ThrThrSerSerGluProAlaProSerGlyCysProProAspSerAspAlaGluSerTyr 
5221 CGACAACATCCTCTGAGCCCGCCCCTTCTG6CTGCCCCCCCGACTCCGACGCTGAGTCCT 
GCTGTTGTAGGAGACTCGGGCGGGGAAGACCGACGGGGGGGCTGAGGCTGCGACTCAGGA 

SerSerMetProProLeuGluGlyGluProGlyAspProAspLeuSerAspGlySerTrp 
5281 ATTCCTCCATGCCCCCCCTGGAGGGGGAGCCTGGGGATCCGGATCTTAGCGACGGGTCAT 
TAAGGAGGTACGGGGGGGACCTCCCCCTCGGACCCCTAGGCCTAGAATCGCTGCCCAGTA 

SerThrValSerSerGluAlaAsnAlaGluAspValValCysCysSerMetSerTyrSer 
5311 GGTCAACGGTCAGTAGTGAGGCCAACGCGGAGGATGTCGTGTGCTGCTCAATGTCTTACT 
CCAGTTGCCAGTCATCACTCCGGTTGCGCCTCCTACAGCACACGACGAGTTACAGAATGA 

TrpThrG 1 yA 1 aLeuVa 1 Th rProCysA 1 aA 1 aG 1 uG 1 uG 1 nLysLeuPro 1 1 eAsnA 1 a 
5101 CTTGGACAGGCGCACTCGTCACCCCGTGCGCCGCGGAAGAACAGAAACTGCCCATCAATG 
GAACCTGTCCGCGTGAGCAGTGGGGCACGCGGCGCCTTCTTGTCTTTGACGGGTAGTTAC 

LeuSerAsnSerLeuLeuArgHisHisAsnLeuValTyrSerThrThrSerArgSerAla 
5161 CACTAAGCAACTCGTTGCTACGTCACCACAATTTGGTGTATTCCACCACCTCACGCAGTG 
GTGATTCGTTGAGCAACGATGCAGTGGTGTTAAACCACATAAGGTGGTGGAGTGCGTCAC 



CysGlnArgGlnLysLysValThrPheAspArgLeuGlnValLeuAspSerHisTyrGln 
CTTGCCAAAGGCAGAAGAAAGTCACATTTGACAGACTGCAAGTTCTGGACAGCCATTACC 
GAACGGTTTCCGTCTTCTTTCAGTGTAAACTGTCTGACGTTCAAGACCTGTCGGTAATGG 



AspVa 1 LeuLy sG 1 uVa 1 LysA 1 a A 1 aA 1 aSerLysVa lLysAl aAsnLeuLeuSe rVa 1 
5581 AG6ACGTACTCAAGGAGGTTAAAGCAGCGGCGTCAAAAGTGAAGGCTAACTTGCTATCCG 
TCCT6CATGAGTTCGTCCAATTTCGTCGCCGCAGTTTTCACTTCCGATTGAAC6ATAGGC 

GluGluAlaCysSerLeuThrProProHisSerAlaLysSerLysPheGlyTyrGlyAla 
5611 TAGAGGAAGCTTbCAGCCTGACGCCCCCACACTCAGCCAAATCCAAGTTTGGTTATGGeG 
ATCTCCTTCGAACGTCGGACTGCGGGGGTGTGAGTCGGTTTAGGTTCAAACCAATACCCC 

LysAspValArgCysHlsAlaArgLysAIaValThrHisIleAsnSerValTrpLysAsp 
5701 CAAAAGACGTCCGTTGCCATGCCAGAAAGGCCGTAACCCACATCAACTCCGTGTGGAAAG 
GTTTTCTGCAGGCAACGGTACGGTCTTTCCGGCATTGGGTGTAGTTGAGGCACACCTTTC 

LeuLeuGluAspAsnValThrProIleAspThrThrlleMetAlaLysAsnGluValPhe 
5761 ACCTTCTGGAAGACAATGTAACACCAATAGACACTACCATCATGGCTAAGAACGAGGTTT 
TGGAAGACCTTCTGTTACATTGTGGTTATCTGTGATGGTAGTACCGATTCTTGCTCCAAA 



5881 



- - -«u-r* i .Tw,o ft rt)mC 1 \t(l 1 rkA rrjV a J g J uPheLeUVa 1 



LeuThr AsnSer ArgG 1 yG 1 uAsnCysG 1 yTy rArgA rgCysA rgA 1 aSerG 1 yVa 1 Leu 
Risi rTPTTArrAATTrAAGGGGGGAGAACTGCGGC 
6181 GAGAATGGTTAAGTTC^ 

ThrThrSe rCysG WAsnThr LeuThrCysTy r I leLysA 1 aA rgA 1 aA laCys ArgA 1 a 
6241 TGACAACTAGCTeTGGTAACAC^ 

ACTGnGATCGACACCATTGTGGGAGTGAACGATG 

CvsSerSer AsnVa lSerVa 1 A 1 aH i sAspG lyAlaGl yLysA rgVa lTy rTy rLeuThr 
RZiRl rATGCTCCTCCAACGTGTCAGTCGCCCACGACGG 
64 GTACGAGGAGGHGCACAGTC^ 

Leu 

6781 GACTC 
CTGAG 
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Sample 

Chimp date (days) ALT (alanine) 

Lane Reference Infection (O=inoculation aminotransferase 



Number 


Number 


Type 


day) 


level 


in sera 


1 


1 


hi a kin 

NANB 


0 




0 


2 


1 


NANB 


76 




71 


3 


1 


NANB 


118 




19 




1 


m a kin 

NANB 


154 




til t A 

N/A 


5 


2 


NANB 


0 




0 


6 


2 


NANB 


21 




52 


7 


2 


NANB 


73 




13 


8 


2 


NANB 


138 




N/A 


9 


3 


NANB 


0 




8 


10 


3 


NANB 


43 




205 


11 


3 


NANB 


53 




14 


12 


3 


NANB 


159 




6 


13 




NANB 


-3 




11 


14 




NANB 


55 




132 


15 




NANB 


83 




N/A 


16 




NANB 


1 ttf\ 

1h0 




hi l a 

N/A 


17 


5 


1 1 A 1 / 

HAV 


0 




4 


18 


5 


HAV 


25 




147 


19 


5 


HAV 


40 




18 


20 


5 


1 1 A % 1 

HAV 


Off) 

268 




1- 

5 


21 


6 


HAV 


-8 




N/A 


22 


6 


HAV 


15 




100 


23 


6 


1 1 All 

HAV 


tt 1 
41 




10 


24 


6 


MAW 

HAV 


129 




M / A 

N/A 


26 


7 


HAV 


0 




7 


27 


7 


HAV 


22 




83 


28 


7 


HAV 


1 1 r 

115 




5 


29 


7 


HAV 


139 




hi / a 

N/A 


7n 
3U 


o 
o 


U A\/ 

HAV 


u 




ID 


31 


8 


HAV 


26 




130 


32 


8 


HAV 


74 




8 


33 


8 


HAV 


205 




5 


34 


9 


HBV 


-290 




N/A 


35 


9 


HBV 


379 




9 


36 


9 


HBV 


435 




6 


37 


10 


HBV 


0 




8 


38 


10 


HBV 


111- 


■118 (pool) 


96- 


39 


10 


HBV 


205 


9 


40 


10 


HBV 


240 




13 


41 


11 


HBV 


0 




11 


42 


11 


HBV 


28- 


•56 (pool) 


8 


43 


11 


HBV 


169 




9 


44 


11 


HBV 


223 




10 



FIG. 33A 




FIG. 33B 



CHIMPS 




HAV HAV HBV HBV HBV 
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Patient 






Lane 


Reference 




AT T T a wa 1 / *i 

ali Lievei vi 


Number 


Number 


Diagnosis 


1 


1 1 

\l 


VTA MTi 




2 




XT A XTTJ 


ii 
j i 


3 


}l 


XT AMU 


1 L 


4 


9l 


VTA Mil 


70 


5 


A 


NANB 


26 


6 


3 


NANB 


78 


7 


ll 


vj a MB. 


ft7 


8 


*i 


XT A KTU 




9 




NANB 


60 


10 




NANB 


13 


11 




« A XTU 




12 




XT A XTO 


1UI 


13 


a 


NANB 


A 7 /■ 
4/4 


14 


Si 


XT A XTD 

NANB 


11 ft 

JIB 


15 


l\ 


NANB 


on 


16 


l\ 


NANB 


ICQ 

lb j 


17 


Si 


NANB 


A A 
44 


18 


8 1 


XT A VTTJ 
NANB 


DU 


19 


9 


NANB 


XT / A 

N/A 


20 


10 


NANB 


XT / A 

N/A 


21 


11 


NANB 


XT / A 

N/A 


22 


12 


Normal 


M / A 

N/A 


23 


13 


Normal 


XT / A 

IN/ A 


24 


14 


Normal 


N/A 


26 


30174 


Normal 


N/A 


27 


30105 


Normal 


N/A 


28 


30072 


Normal 


N/A 


29 


30026 


Normal 


N/A 


30 


30146 


Normal 


N/A 


31 


30250 


Normal 


N/A 


32 


30C71 


Normal 


N/A 


33 


15 


AcuteHAV 


N/A 


34 


16 


AcuteHAV 


VT / A 

N/A 


35 


17 


AcuteHAV 


N/A 


36 


18 


AcuteHAV 


N/A 


37 


48088 


AcuteHAV 


N/A 


38 


47288 


AcuteHAV 


N/A 


39 


47050 


AcuteHAV 


N/A 


40 


46997 


AcuteHAV 


N/A 


41 


19 


Convalescent HBV 


N/A 


42 


20 


(anti-HBSag+ve; 


N/A 


43 


21 


anti-HBCag+ve) 


N/A 


44 


22 


(anti-HBSag+ve; 


N/A 


45 


23 


anti-HBCag+ve) 


N/A 


46 


24 


(anti-HBSag+ve; 


N/A 


47 


25 


anti-HBCag+ve) 


N/A 


48 


26 


(anti-HBSag+ve; 


N/A 


49 


27 


anti-HBSag+ve) 


N/A 



^■Sequential 



serum samples were assayed from these patients 
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FIG. 34A 




FIG. 35 



SphI 12350, 



Clal*1 

Hind 3 7 
PstI 212 
f NcoI439 



Ncol 11806 



Xbal 
10727. 



Hind 3 
10413 



Pvul 
9675-M 



Hind 3 
9099 

Ecorl 
9023 

Ncol 8484 

Ecorl 8260 

KpnI78TO 
Clal 7774 

Pvul 7483 



Smal 1113 

^Hind3ll73 
Bamhl 1519 
,Sphl 1706 
Sail 1795 




PstI 4753 
Pvul 4879 



Hpal 7248 



Ecorl 5505 
Clal 5530 
Ecorl 5565 

Hind3 5669 
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SOD COOM l —adaptor™- ] I NANBHpo 1 ypept ide> 

AlaCysGlyVallleGlylleAlaGlnAsnLeuGlylleArgAspAlaHisPheLeuSer 
1 GCTTGTGGTGTAATTGGGATCGCCCAGAATTTGGGAATTCGGGATGCCCACTTTCTATCC 
CGAACACCACATTAACCCTAGCGGGTCTTAAACCCTTAAGCCCTACGGGTGAAAGATAGG 

»>»»»»»»»>»> 

G 1 nTh rLysG 1 nSe rGlyGl uAsnLeuP roTy r LeuVa lAlaTyrGlnAl aTh rVa 1 Cys 
61 CAGACAAAGCAGAGTGGGGAGAACCTTCCTTACCTGGTAGCGTACCAAGCCACCGTGTGC 
GTCTGTTTCGTCTCACCCCTCTTGGAAGGAATGGACCATCGCATGGTTCGGTGGCACACG 

AlaArgAlaGlnAlaProProProSerTrpAspGlnMetTrpLysCysLeuIleArgLeu 
121 GCTAGGGCTCAAGCCCCTCCCCCATCGTGGGACCAGATGTGGAAGTGTTTGATTCGCCTC 
CGATCCCGAGTTCGGGGAGGGGGTAGCACCCTGGTCTACACCTTCACAAACTAAGCGGAG 

LysProThrLeuH isGlyP roThrProLeuLeuTy r A rgLeuG lyAlaValGl nAsnG 1 u 
181 AAGCCCACCCTCCATGGGCCAACACCCCTGCTATACAGACTGGGCGCTGTTCAGAATGAA 
TTCGGGTGGGAGGTACCCGGTTGTGGGGACGATATGTCTGACCCGCGACAAGTCTTACTT 

IleThrLeuThrHisProValThrLysTyrlleMetThrCysMetSerAlaAspLeuGlu 
211 ATCACCCTGACGCACCCAGTCACCAAATACATCATGACATGCATGTCGGCCGACCTGGAG 
TAGTGGGACTGCGTGGGTCAGTGGTTTATGTAGTACTGTACGTACAGCCGGCTGGACCTC 

Va 1 Va IThrSe rThrTrpVa 1 LeuVa 1 G 1 yG 1 yVa lLeuAl aA 1 aLeuA 1 aA 1 aTy rCys 
301 GTCGTCACGAGCACCTGGGTGCTCGTTGGCGGCGTCCTGGCTGCTTTGGCCGCGTATTGC 
CAGCAGTGCTCGTGGACCCACGAGCAACCGCCGCAGGACCGACGAAACCGGGGCATAACG 

LeuSerThrGlyCysValVallleValGlyArgValValLeuSerGlyLysProAlalle 
361 CTGTCAACAGGCTGCGTGGTCATAGTGGGCAGGGTCGTCTTGTCCGGGAAGCCGGCAATC 
GACAGTTGTCCGACGCACCAGTATCACCCGTCCCAGCAGAACAGGCCCTTCGGCCGTTAG 

1 1 eProAspArgG 1 uVa 1 LeuTy r ArgG 1 uPheAspG 1 uMetG 1 uG 1 uCysSe rGlnHis 
421 ATACCTGACAGGGAAGTCCTCTACCGAGAGTTCGATGAGATGGAAGAGTGCTCTCAGCAC 
TATGGACTGTCCCTTCAGGAGATGGCTCTCAAGCTACTCTACCTTCTCACGAGAGTCGTG 

LeuProTyrlleGluGlnGlyMetMetLeuAlaGluGlnPheLysGlnLysAlaLeuGly 
481 TTACCGTACATCGAGCAAGGGATGATGCTCGCCGAGCAGTTCAAGCAGAAGGCCCTCGGC 
AATGGCATGTAGCTCGTTCCCTACTACGAGCGGCTCGTCAAGTTCGTCTTCCGGGAGCCG 

LeuLeuGlnThrAlaSerArgGlnAlaGluVallleAlaProAlaValGlnThrAsnTrp 
5*»1 CTCCTGCAGACCGCGTCCCGTCAGGCAGAGGTTATCGCCCCTGCTGTCCAGACCAACTGG 
GAGGACGTCTGGCGCAGGGCAGTCCGTCTCCAATAGCGGGGACGACAGGTCTGGTTGACC 

GlnLysLeuGluThrPheTrpAlaLysHisMetTrpAsnPhelleSerGlylleGlnTyr 
601 CAAAAACTCGAGACCTTCTGGGCGAAGCATATGTGGAACTTCATCAGTGGGATACAATAC 
GTTTTTGAGCTCTGGAAGACCCGCTTCGTATACACCTTGAAGTAGTCACCCTATGTTATG 

LeuAlaGlyLeuSerThrLeuProGlyAsnProAlalleAlaSerLeuMetAlaPheThr 
661 TTGGCGGGCTTGTCAACGCTGCCTGGTAACCCCGCCATTGCTTCATTGATGGCTTTTACA 
AACCGCCCGAACAGTTGCGACGGACCATTGGGGCGGTAACGAAGTAACTACCGAAAATGT 

AlaAlaValThrSerProLeuThrThrSerGlnThrLeuLeuPheAsnlleLeuGlyGly 
721 GCTGCTGTCACCAGCCCACTAACCACTAGCCAAACCCTCCTCTTCAACATATTGGGGGGG 
CGACGACAGTGGTCGGGTGATTGGTGATCGGTTTGGGAGGAGAAGTTGTATAACCCCCCC 

TrpVa lAlaAlaGl nLeuA I aA l aProG lyAlaAl aTh r A 1 aPheVa lGlyAlaGlyLeu 
781 TGGGTGGCTGCCCAGCTCGCCGCCCCCGGTGCCGCTACTGCCTTTGTGGGCGCTGGCTTA 
ACCCACCGACGGGTCGAGCGGCGGGGGCCACGGCGATGACGGAAACACCCGCGACCGAAT 



FIG. 36B 

AlaGlyAlaAlalleGlySerValGlyLeuGlyLysValLeuIleAspIleLeuAlaGly 
841 GCTGGCGCCGCCATCGGCAGTGTTGGACTGGGGAAGGTCCTCATAGACATCCTTGCAGGG 
CGACCGCGGCGGTAGCCGTCACAACCTGACCCCTTCCAGGAGTATCTGTAGGAACGTCCC 

Ty rG lyAlaGlyValAlaGlyAl aLeuVa 1 A 1 aPheLys 1 leMetSe rGlyGl uVa 1 Pro 
901 TATGGCGCGGGCGTGGCGGGAGCTCTTGTGGCATTCAAGATCATGAGCGGTGAGGTCCCC 
ATACCGCGCCCGCACCGCCCTCGAGAACACCGTAAGTTCTAGTACTCGCCACTCCAGGGG 

Se rTh rG 1 uAspLeuVa 1 AsnLeuLeuProA 1 a II eLeuSerProG 1 yA 1 aLeuVa 1 Va 1 
961 TCCACGGAGGACCTGGTCAATCTACTGCCCGCCATCCTCTCGCCCGGAGCCCTCGTAGTC 
AGGTGCCTCCTGGACCAGTTAGATGACGGGCGGTAGGAGAGCGGGCCTCGGGAGCATCAG 

GlyValValCysAlaAlalleLeuArgArgHisValGlyProGlyGluGlyAlaValGln 
1021 GGCGTGGTCTGTGCAGCAATACTGCGCCGGCACGTTGGCCCGGGCGAGGGGGCAGTGCAG 
CCGCACCAGACACGTCGTTATGACGCGGCCGTGCAACCGGGCCCGCTCCCCCGTCACGTC 

««<«<««<«««<NANBH] t— extra 
TrpMetAsnArgLeuIleAlaPheAlaSerArgGlyAsnHisValSerProValHisHis 
1081 TGGATGAACCGGCTGATAGCCTTCGCCTCCCGGGGGAACCATGTTTCCCCAGTCCATCAT 
ACCTACTTGGCCGACTATCGGAAGCGGAGGGCCCCCTTGGTACAAAGGGGTCAGGTAGTA 

] 

LysArcjOP 

1141 AAGCGTTGACGCTCCCTACGGGTGGACTGTGGAGAGACAGGGCACTGCTAAGGCCCAAAT 
TTCGCAACTGCGAGGGATGCCCACCTGACACCTCTCTGTCCCGTGACGATTCCGGGTTTA 

1201 CTCAGCCATGCATCGAGGGGTACAATCCGTATGGCCAACAACTAGCGCGTACGTAAAGTC 
GAGTCGGTACGTAGCTCCCCATGTTAGGCATACCGGTTGTTGATCGCGCATGCATTTCAG 

1261 TCCTTTCTCGATGGTCCATACCTTAGATGCGTTAGCATTAATCCGAATTC 
AGGAAAGAGCTACCAGGTATGGAATCTACGCAATCGTAATTAGGCTTAAG 




FIG. 37A 

FIG. 37B 





FIG. 38 

1 2 3 4 




FIG. 40 

12 3 4 



FIG. 41A 

ABC 




FIG. 41 B 

ABC 



FIG. 42A 

10 20 30 40 50 

HCV EYVVLLFLLLADARVCSCLWMMLL I SQAEAALENLVI LNAASLAGTHGLVSFLVFFCFA 

MNWVD1 AVSFVTLITGNMSFRDLGRVMVMVGATMTDDIGM6VTYLALLAAFKVRPTFAA6LLLRKL 
130 140 150 160 170 180 

60 70 80 90 100 110 

HCV WYLKGKWVPGAVYTFYGMWPLLLLLLALPQRAYALDTEVAASCGGVVLVGLMALTLSPYY 

MNWVD1 TSKELMMTTIGIVLLSQSTIPETILELTDALALGMMVLKMVRKMEKYQLAVTIMAILCVP 
190 200 210 220 230 240 

120 130 140 150 160 170 

HCV KRY I SWCLWWLQYFLTRVEAQLHVW I PPLNVRGGRDAVI LLMCAVHPTLVFD I TKLLLAV 

MNWVD1 NAVILQNAWKVSCTiLAWSVSPLFLTSSQQKADWIPLALTiKGLN^ 

250 • 260 270 280 290 

180 190 200 210 220 230 

HCV FGPLWI LQASLLKVPYF-VRVQGLLRF-CALARKMIGGHYVQMVI I KLGALTGTYVYNHL 

MNWVD1 KKRSWPLNEAiMAVGMVSiLASSLLKNDIPMTGPLVAGGLLTVCYV^ 

300 310 320 330 340 350 

240 250 260 270 280 290 

HCV TPLRDWAHNGLRDLAVAVEPVVFSQMETKLITWGADTAACGDIINGLPVSARRGREILLG 

MNWVD1 ADVK-WEDQAEI SGSSPi LSif ISE-D6SMS i KNEEEEQTLt I L I RTGLLV I SG — LFP 
360 370 380 390 400 410 

300 310 320 330 340 350 

HCV PADGMVSKGWRLLAPITAYAQQTRGLLGCIITSLTGRDKNQVEGEVQIVSTAAQTFLATC 

MNWVD1 VS I Pi f AAAWYtWEVKKQRAGVLWDVPSPPPVGKAELEDGA YR I KQKG i LGYSQ } GAGVY 
420 430 440 450 460 470 

360 370 380 390 400 410 

HCV INGVCWTVYHGAGTRTIASPKGPVIQMYTNVDQDLV GWPAPQGSRSLTPCTCGSSD 

MNWVD1 KEGTFHtMWHVTRGAVLMHKGKRiEPSWADVKKD^ 

480 490 500 510 520 530 

420 430 440 450 460 470 

HCV LYLVTRHADVIPVRRRGDSRGSLLSPRPISYLKGSSGGPLLCPAGHAVGIFRAAVCTRGV 

MNWVD1 PGKNPRAVQtKPGLFKTN--AGTiGA^ 

540 550 560 570 580 590 




FIG. 42B 

180 190 500 510 520 530 

HCV AKAVDFIPVENLETTMRSPVFTDNSSPPVVPQSFQVAHLHAPTGSGKS— TKVPAAYAAQ 

MNWVD1 AYVSA i AQT EK--S I EDNPE I E66 1 FRK — RKLTI MD^PGAGKt^RYLPA i VRG A I KR 
600 610 620 630 610 

510 550 560 570 580 

HCV GYKVLVLNPS--VAATLGFGAYMSKAHGIDPNIRTGVRTITTGSPITYSTYGKFLADGGC 

MNWVD1 GLRtli LAPf RVVAAEMEEALRGLPI RYOTPAI RA EHTGREI VDLICHAtnML-SPV 
650 660 670 680 690 700 

590 600 610 620 630 610 

HCV SGGAYDI 1 1 CDECHSTDATS I LG I GTVLDQAETAGARLVVLATATPPGSVTVPHPN I EEV 

MNWVD1 RVPNYNL 1 1 MDEAHFf DPA S I A ARGY } sf RVE-MGEAAG i FMTATPPGSRD-^F^QSNAP 
710 720 730 710 750 760 

650 660 670 680 690 700 

HCV ALSTTGEI PFYGKA I PLEVI KGGRHt I FCHSKKKCDELAAKLVALGINAVAYYRGLDVSV 

MNWVD1 IMDEEREiPERSWSSGHEWVTDFKGKTVWFVPSIKAGNDTAACLRKNGKKVTQLSRKTFD 
770 780 790 « n 0 810 820 

710 720 730 710 750 760 

HCV IPTSGDVVVVATDALMTGYTGDFDSVIDCNTCVTQTVDFSLDPTFTIETITLPQDAVSRT 

MNWVD1 SEYVKTRTNDWNFVVTTD I SEMG ANFKAERV I DPRRCMKPV I LTDGEERV I L AGPMPVTH 
830 810 850 860 870 880 

770 780 790 800 810 820 

HCV QRRGRTGRGKPGIYRFVAPGERPSGMFDSSVLCECYDAGCAWYELTPAETTVRLRAYMNT 

MNWVD1 SS 




FIG. 43 
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FIG. 45 

Name Common Sequence Variable Sequence 



5 ' - 3 - 1 AAGCTTGATCGAATTC CGATCTTGC 

-2 CGATCCTGC 

-3 CGATCATGC 

-4 CGATCGTGC 

-5 CGAAGTTGC 

-6 CGAAGCTGC 

-7 AGATCTTGC 

-8 AGATCCTGC 

-9 AGATCATGC 

-10 AGATCGTGC 

-11 AGAAGTTGC 

-12 AGAAGCTGC 

-13 CGATCTTGT 

-14 CGATCCTGT 

-15 CGATCATGT 

-16 CGATCGTGT 

-17 CGAAGTTGT 

-18 CGAAGCTGT 

-19 AGATCTTGT 

-20 AGATCCTGT 

-21 AGATCATGT 

-22 AGATCGTGT 

-23 AGAAGTTGT 

-24 AGAAGCTGT 

-25 CGCTCTTGC 

-26 CGCTCCTGC 

-27 CGCTCATGC 

-28 CGCTCGTGC 

-29 CGCAGTTGC 

-30 CGCAGCTGC 

-31 CGCTCTTGT 

-32 CGCTCCTGT 

-33 CGCTCATGT 

-34 CGCTCGTGT 

-35 CGCAGTTGT 

-36 CGCAGCTGT 
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FIG. 47 A 

61yCysProGluArgLeuAlaSerCysArgPrbLeuThrAspPlfeAspGlnGlyTrpGly 
1 CAGGCTGTCCTGAGAGGCTAGCCAGCTGCCGACCCCTTACCGATTTTGACCAGGGCTGGG 
GTCCGACAGGACTCTCCGATCGGTCGACGGCTGGGGAATGGCTAAAACTGGTCCCGACCC 

ProIleSerTyrAlaAsnGlySerGlyProAspGlnArgProTyrCysTrpHisTyrPro 
61 GCCCTATCAGTTATGCCAACGGAAGCGGCCCCGACCAGCGCCCCTACTGCTGGCACTACC 
CGGGATAGTCAATACGGTTGCCTTCGCCGGGGCTGGTCGCGGGGATGACGACCGTGATGG 

ProLysProCysG lylleVal ProAl aLysSerVa 1 CysG 1 yProVa HyrCysPheThr 
121 CCCCAAAACCTTGCGGTATTGTGCCCGCGAAGAGTGTGTGTGGTCCGGTATATTGCTTCA 
GGG6TTTTGGAACGCCATAACACGGGCGCTTCTCACACACACCAGGCCATATAACGAAGT 

ProSerProValValValGlyThrThrAspArgSerGlyAlaProThrTyrSerTrpGly 
181 CTCCCAGCCCCGTGGTGGTGGGAACGACCGACAGGTCGGGCGCGCCCACCTACAGCTGGG 
GAGGGTCGGGGCACCACCACCCTTGCTGGCTGTCCAGCCCGCGCGGGTGGATGTCGACCC 

GluAsnAspThrAspValPheValLeuAsnAsnThrArgProProLeuGlyAsnTrpPhe 
211 GTGAAAATGATACGGACGTCTTCGTCCTTAACAATACCAGGCCACCGCTGGGCAATTGGT 
CACTTTTACTATGCCTGCAGAAGCAGGAATTGTTATGGTCCGGTGGCGACCCGTTAACCA 

GlyCysThrTrpMetAsnSerThrGlyPheThrLysValCysGlyAlaProProCysVal 
301 TCGGTTGTACCTGGATGAACTCAACTGGATTCACCAAAGTGTGCGGAGCGCCTCCTTGTG 
AGCCAACATGGACCTACTTGAGTTGACCTAAGTGGTTTCACACGCCTCGCGGAGGAACAC 

I leGlyGlyAlaGl yAsnAsnThrLeuH i sCysProThrAspCysPheA rgLysH i sPro 
361 TCATCGGAGGGGCGGGCAACAACACCCTGCACTGCCCCACTGATTGCTTCCGCAAGCATC 
AGTAGCCTCCCCGCCCGTTGTTGTGGGACGTGACGGGGTGACTAACGAAGGCGTTCGTAG 

AspAlaThrTyrSerArgCysGlySerGlyProTrpIleThrProArgCysLeuValAsp 
421 CGGACGCCACATACTCTCGGTGCGGCTCCGGTCCCTGGATCACACCCAGGTGCCTGGTCG 
GCCTGCGGTGTATGAGAGCCACGCCGAGGCCAGGGACCTAGTGTGGGTCCACGGACCAGC 

TyrProTyrArgLeuTrpHisTyrProCysThrl leAsnTyrThrl lePheLysI leArg 
481 ACTACCCGTATAGGCTTTGGCATTATCCTTGTACCATCAACTACACCATATTTAAAATCA 
TGATGGGCATATCCGAAACCGTAATAGGAACATGGTAGTTGATGTGGTATAAATTTTAGT 

MetTyrValGlyGlyValGluHisArgLeuGluAlaAlaCysAsnTrpThrArgGlyGlu 
511 GGATGTACGTGGGAGGGGTCGAACACAGGCTGGAAGCTGCCTGCAACTGGACGCGGGGCG 
CCTACATGCACCCTCCCCAGCTTGTGTCCGACCTTCGACGGACGTTGACCTGCGCCCCGC 

ArgCysAspLeuGluAspArgAspArgSerGluLeuSerProLeuLeuLeuThrThrThr 
601 AACGTTGCGATCTGGAAGACAGGGACAGGTCCGAGCTCAGCCCGTTACTGCTGACCACTA 
TTGCAACGCTAGACCTTCTGTCCCTGTCCAGGCTCGAGTCGGGCAATGACGACTGGTGAT 

G 1 nTrpG 1 nVa 1 LeuProCysSerPheThrThrLeuProA laLeuSerThrG lyLeu I le 
661 CACAGTGGCAGGTCCTCCCGTGTTCCTTCACAACCCTACCAGCCTTGTCCACCGGCCTCA 
GTGTCACCGTCCAGGAGGGCACAAGGAAGTGTTGGGATGGTCGGAACAGGTGGCCGGAGT 

HisLeuHisGlnAsnlleValAspValGlnTyrLeuTyrGlyValGlySerSerlleAla 
721 TCCACCTCCACCAGAACATTGTGGACGTGCAGTACTTGTACGGGGTGGGGTCAAGCATCG 
AGGTGGAGGTGGTCTTGTAACACCTGCACGTCATGAACATGCCCCACCCCAGTTCGTAGC 

SerTrpA 1 a I leLysTrpG 1 uTy rVa 1 Va 1 LeuLeuPheLeuLeuLeuA laAspA 1 aArg 
781 CGTCCTGGGCCATTAAGTGGGAGTACGTCGTTCTCCTGTTCCTTCTGCTTGCAGACGCGC 
GCAGGACCCGGTAATTCACCCTCATGCAGCAAGAGGACAAGGAAGACGAACGTCTGCGCG 

Va 1 CysSerCysLeuTrpMetMetLeuLeuI leSerGlnAl aGluAl aA laLeuG 1 uAsn 
811 GCGTCTGCTCCTGCTTGTGGATGATGCTACTCATATCCCAAGCGGAGGCGGCTTTGGAGA 
CGCAGACGAGGACGAACACCTACTACGATGAGTATAGGGTTCGCCTCCGCCGAAACCTCT 

LeuVa 1 1 leLeuAsnA laA 1 aSerLeuA 1 aG lyThrHi sG lyLeuVa 1 SerPheLeuVal 
901 ACCTCGTAATACTTAATGCAGCATCCCTGGCCGGGACGCACGGTCTTGTATCCTTCCTCG 
TGGAGCATTATGAATTACGTCGTAGGGACCGGCCCTGCGTGCCAGAACATAGGAAGGAGC 



FIG.47B 

PhePheCysPheAlaTrDTyrLeuLysGlyLysTrpValProGlyAlaValTyrihrPhe 
961 TGTTCTTCT6CTTTGCAT6GTATTT6AAGGGTAAGTGGGTGCCCGGAGCGGTCTACACCT 
ACAAGAAGACGAAACGTACCATAAACTTCCCATTCACCCACGGGCCTCGCCAGATGTGGA 

TyrGlyMetTrpProLeuLeuLeuLeuLeuLeuAlaLeuProGlnArgAlaTyrAlaLeu 
1021 TCTACGGGATGTGGCCTCTCCTCCTGCTCCTGTTGGC6TTGCCCCAGCGGGCGTACGCGC 
AGATGCCCTACACCGGAGAGGAGGACGAGGACAACCGCAACGGGGTCGCCCGCATGCGCG 

AspThrGluValAlaAlaSerCvsGlyGlyValValLeuValGlyLeuMetAlaLeuThr 
1081 TGGACACGGAGGTGGCCGCGTCGTGTGGCGGTGTTGTTCTCGTCGGGTTGATGGCGCTGA 
ACCTGTGCCTCCACCGGCGCAGCACACCGCCACAACAAGAGCAGCCCAAGTACCGCGACT 

LeuSerProTyrTyrLysArgTyr I leSerTrpCysLeuTrpTrpLeuGlnTyrPheLeu 
1111 CTCTGTCACCATATTACAAGCGCTATATCAGCTGGTGCTTGTGGTGGCTTCAGTATTTTC 
GAGACAGTG6TATAATGTTCGCGATATAGTCGACCACGAACACCACCGAAGTCATAAAAG 

ThrArgValGluAlaGlnLeuHisValTrpIleProProLeuAsnValArgGlyGlyArg 
1201 TGACCAGAGTGGAAGCGCAACTGCACGTGTGGATTCCCCCCCTCAACGTCCGAGGGGGGC 
ACTGGTCTCACCTTCGCGTTGACGTGCACACCTAAGGGGGGGAGTTGCAGGCTCCCCCCG 

, ^ , ~ $sp a laVall leLeuLeuMetCysA 1 aVa 1 H i sProThrLeuVa 1 PheAspI 1 eThrLys 
1261 GCGACGCCGTCATCTTACTCATGTGTGCTGTAGACCCGACTCTGGTATTTGACATCACCA 
CGCTGCGGCAGTAGAATGAGTACACACGACATGTGGGCTGAGACCATAAACTGTAGTGGT 

LeuLeuLeuA 1 aVa 1 PheG 1 yProLeuTrp I leLeuG 1 nA 1 aSe rLeuLeuLysVa 1 Pro 
1321 AATTGCTGCTGGCCGTCTTCGGACCCCTTTGGATTCTTCAAGCCAGTTTGCTTAAAGTAC 
TTAACGACGACCGGCAGAAGCCTGGGGAAACCTAAGAAGTTCGGTCAAACGAATTTCATG 

TyrPheValArgValGlnGlyLeuLeuArgPheCysAlaLeuAlaArgLysMetlleGly 
1381 CCTACTTTGTGCGCGTCCAAGGCCTTCTCCGGTTCTGCGCGTTAGCGCGGAAGATGATCG 
GGATGAAACACGCGCAGGTTCCGGAAGAGGCCAAGACGCGCAATCGCGCCTTCTACTAGC 

GlyHlsTyrValGlnMetValllelleLysLeuGlyAlaLeuThrGlyThrTyrValTyr 
1M1 GAGGCCATTACGTGCAAATGGTCATCATTAAGTTAGGGGCGCTTACTGGCACCTATGTTT 
CTCCGGTAATGCACGTTTACCAGTAGTAATTCAATCCCCGCGAATGACCGTGGATACAAA 

, ' AsnHisLeuThrProLeuArgAspTrpAlaHisAsnGlyLeuArgAspLeuAlaValAla 
1501 ATAACCATCTCACTCCTCTTCGGGACTGGGCGCACAACGGCTTGCGAGATCTGGCCGTGG 
TATTGGTAGAGTGAGGAGAAGCCCTGACCCGCGTGTTGCCGAACGCTCTAGACCGGCACC 

ValGluProvalValPheSerGlnMetGluThrLysLeuIleThrTrpGlyAlaMspThr 
1561 CTGTAGAGCCAGTCGTCTTCTCCCAAATGGAGACCAAGCTCATCACGTGGGGGGCAGATA 
GACATCTCGGTCAGCAGAAGAGGGTTTACCTCTGGTTCGAGTAGTGCACCCCCCGTCTAT 

AlaAlaCysGlyAspIlelleAsnGlyLeuProValSerAlaArgArgGlyArgGluIle 
1621 CCGCCGCGTGCGGTGACATCATCAACGGCTTGCCTGTTTCCGCCCGCAGGGGCCGGGAGA 
GGCGGCGCACGCCACTGTAGTAGTTGCCGAACGGACAAAG6CGGGCGTCCCCGGCCCTCT 

LeuLeuGlyProAlaAspGlyMetValSerLysGlyTrpArgLeuLeuAlaProIleThr 
1681 TACTGCTCGGGCCAGCCGATGGAATGGTCTCCAAGGGGTGGAGGTTGCTGGCGCCCATCA 
ATGACGAGCCCGGTCGGCTACCTTACCAGAGGTTCCCCACCTCCAACGACCGCGGGTAGT 

AlaTyrAlaGlnGlnThrArgGlyLeuLeuGlyCysMelleThrSerLeuThrGlyArg 
1711 CGGCGTACGCCCAGCAGACAAGGGGCCTCCTAGGGTGCATAATCACCAGCCTAACTGGCC 
GCCGCATGCGGGTCGTCTGTTCCCCGGAGGATCCCACGTATTAGTGGTCGGATTGACCGG 

AspLysAsnGlnValGluGIyGluValGlnlleValSerThrAlaAlaGlnThrPheLeu 
1801 GGGACAAAAACCAAGTGGAGGGTGAGGTCCAGATTGTGTCAACTGCTGCCCAAACCTTCC 
CCCTGTTTTTGGTTCACCTCCCACTCCAGGTCTAACACAGTTGACGACGGGTTTGGAAGG 

,„„ AlaThrCysIleAsnGlyValCysTrpThrValTyrHisGlyAlaGlyThrArgThrlle 
1861 TGGCAACGTGCATCAATGGGGTGTGCTGGACTGTCTACCACGGGGCCGGAACGAG5ACCA 
ACCGTTGCACGTAGTTACCCCACACGACCTGACAGATGGTGCCCCGGCCTTGCTCCTGGT 



1921 



A laSerProLysG 1 yProVa 1 1 leGl nMetTy rThr AsnVa 1 AspG InAspLeuVa 1G ly 
TCGCGTCACCCAAGGGTCCTGTCATCCAGATGTATACCAATGTAGACCAAGACCTTGTGG 



FIG. 47C 

TrpProAlaProGlnGlySerArgSerLeuThrProCysThrCysGlySer.SerAspLeu 
1981 GCTGGCCCGCTCCGCAAGGTAGCCGCTCATTGACACCCTGCACTTGeGGCTCCTCGGACC 
CGACCGGGCGAGGCGTTCCATCGGCGAGTAACTGTGGGACGTGAACGCCGAGGAGCCTGG 

TyrLeuValThrArgHisAlaAspVallleProVaiArgArgArgGlyAspSerArgGly 
2011 TTTACCTGGTGACGAGGCACGCCGATGTCATTCCCGTGCGCCGGCGGGGTGATAGCAGGG 
AAATGGACCAGTGCTCCGTGCGGCTACAGTAAGGGCACGCGGCCGCCCCACTATCGTCCC 

SerLeuLeuSerProArgProIleSerTyrLeuLysGlySerSerGlyGlyProLeuLeu 
2101 GCAGCCTGCTGTCGCCCCGGCCCATTTCCTACTTGAAAGGCTCCTCGGGGGGTCCGCTGT 
CGTCGGACGACAGCGGGGCCGGGTAAAGGATGAACTTTCCGAGGAGCCCCCCAGGCGACA 

CysProAlaGlyHisAlaValGlyliePheArgAlaAlaValCysThrArgGlyValAla 
2161 TGTGCCCCGCGGGGCACGCCGTGGGCATATTTAGGGCCGCGGTGTGCACCCGTGGAGTGG 
ACACGGGGCGCCCCGTGCGGCACCCGTATAAATCCCGGCGCCACACGTGGGCACCTCACC 

LysAlaValAspPhelleProValGluAsnLeuGluThrThrMetArgSerProValPhe 
2221 CTAAGGCGGTGGACTTTATCCCTGTGGAGAACCTAGAGACAACCATGAGGTCCCCGGTGT 
GATTCCGCCACCTGAAATAGGGACACCTCTTGGATCTCTGTTGGTACTCCAGGGGCCACA 

ThrAspAsnSerSerProProValValProGlnSerPheGlnValAlaHisLeuHisAla 
2281 TCACGGATAACTCCTCTCCACCAGTAGTGCCCCAGAGCTTCCAGGTGGCTCACCTCCATG 
AGTGCCTATTGAGGAGAGGTGGTCATCACGGGGTCTCGAAGGTCCACCGAGTGGAGGTAC 

ProTh rG 1 ySe rG 1 y LysSe rTh r LysVa 1 ProA 1 a A 1 aTy r A 1 aA 1 aG 1 nG 1 yTy rLy s 
2311 CTCCCACAGGCAGCGGCAAAAGCACCAAGGTCCCGGCTGCATATGCAGCTCAGGGCTATA 
GAGGGTGTCCGTCGCCGTTTTCGTGGTTCCAGGGCCGACGTATACGTCGAGTCCCGATAT 

Va 1 LeuVa 1 LeuAsnProSe rVa lAlaAl aThrLeuG 1 yPheG 1 yA 1 aTy rMetSe rLys 
2101 AGGTGCTAGTACTCAACCCCTCTGTTGCTGCAACACTGGGCTTTGGTGCTTACATGTCCA 
TCCACGATCATGAGTTGGGGAGACAACGACGTTGTGACCCGAAACCACGAATGTACAGGT 

AlaHisGlylleAspProAsnlleArgThrGlyValArgThrlleThrThrGlySerPro 
2161 AGGCTCATGGGATCGATCCTAACATCAGGACCGGGGTGAGAACAATTACCACTGGCAGCC 
TCCGAGTACCCTAGCTAGGATTGTAGTCCTGGCCCCACTCTTGTTAATGGTGACCGTCGG 

.IleThrTyrSerThrTyrGlyLysPheLeuAlaAspGlyGlyCysSerGlyGlyAlaTyr 
2521 CCATCACGTACTCCACCTACGGCAAGTTCCTTGCCGACGGCGGGTGCTCGGGGGGCGCTT 
GGTAGTGCATGAGGTGGATGCCGTTCAAGGAACGGCTGCCGCCCACGAGCCCCCCGCGAA 

AspIlellelleCysAspGluCysHisSerTh-rAspAlaThrSerlleLeuGlylleGly 
2581 ATGACATAATAATTTGTGACGAGTGCCACTCCACGGATGCCACATCCATCTTGGGCATCG 
TACTGTATTATTAAACACTGCTCACGGTGAGGTGCCTACGGTGTAGGTAGAACCCGTAGC 

ThrValLeuAspGlnAlaGluThrAlaGlyAlaArgLeuValValLeuAlaThrAlaThr 
2611 GCACTGTCCTTGACCAAGCAGAGACTGCGGGGGCGAGACTGGTTGTGCTCGCCACCGCCA 
CGTGACAGGAACTGGTTCGTCTCTGACGCCCCCGCTCTGACCAACACGAGCGGTGGCGGT 

ProProGlySerValThrValProHlsProAsnlleGluGluValAlaLeuSerThrThr 
2701 CCCCTCCGGGCTCCGTCACTGTGCCCCATCCCAACATCGAGGAGGTTGCTCTGTCCACCA 
GGGGAGGCCCGAGGCAGTGACACGGGGTAGGGTTGTAGCTCCTCCAACGAGACAGGTGGT 

GlyGluIleProPheTyrGlyLysAlalleProLeuGluVallleLysGlyGlyArgm 
2761 CCGGAGAGATCCCTTTTTACGGCAAGGCTATCCCCCTCGAAGTAATCAAGGGGGGGAGAC 
GGCCTCTCTAGGGAAAAATGCCGTTCCGATAGGGGGAGCTTCATTAGTTCCCCCCCTCTG 

LeuIlePheCysHisSerLysLysLysCysAspGluLeuAlaAlaLysLeuValAlaLeu 
2821 ATCTCATCTTCTGTCATTCAAAGAAGAAGTGCGACGAACTCGCCGCAAAGCTGGTCGCAT 
TAGAGTAGAAGACAGTAAGTTTCTTCTTCACGCTGCTTGAGCGGCGTTTCGACCAGCGTA 

GlylleAsnAlaValAlaTyrTyrArgGlyLeuAspValSerVallleProThrSerGly 
2881 TGGGCATCAATGCCGTGGCCTACTACCGCGGTCTTGACGTGTCCGTCATCCCGACCAGCG 
ACCCGTAGTTACGGCACCGGATGATGGCGCCAGAACTGCACAGGCAGTAGGGCTGGTCGC 

AspValValValValAlaTlirAspAlaLeuMetThrGlyTyrThrGlyAspPheAspSer 
2911 GCGATGTTGTCGTCGTGGCAACCGATGCCCTCATGACCGGCTATACCGGCGACTTCGACT 
CGCTACAACAGCAGCACCGTTGGCTACGGGAGTACTGGCCGATATGGCCGCTGAAGCTGA 



FIG. 47D 

Val I leAspCysAsnThrCysValThrGlnThrValAspPheSerLeuAspProThrPhe 
3001 CGGTGATAGACTGCAATACGTGTGTCACCCAGACAGTCGATTTCAGCCTTGACCCTACCT 
6CCACTATCTGACGTTATGCACACAGTGGGTCTGTCAGCTAAAGTCGGAACTGGGATGGA 

ThrlleGluThrlleThrLeuProGlnAspAlaValSerArgThrGlnArgArgGlyArg 
3061 TCACCATTGAGACAATCACGCTCCCCCAGGATGCTGTCTCCCGCACTCAACGTCGGGGCA 
AGTGGTAACTCTGTTAGTGCGAGGGGGTCCTACGACAGAGGGCGTGAGTTGCAGCCCCGT 

ThrGlyArgGlyLysProGlylleTyrArgPheValAlaProGlyGluArgProSerGly 
3121 GGACTGGCAGGGGGAAGCCAGGCATCTACAGATTTGTGGCACCGGGGGAGCGCCCCTCCG 
CCTGACCGTCCCCCTTCGGTCCGTAGATGTCTAAACACCGTGGCCCCCTCGCGGGGAGGC 

MetPheAspSerSerVa 1 LeuCysG 1 uCysTy rAspA laG lyCysA 1 aTrpTy rG 1 uLeu 
3181 GCATGTTCGACTCGTCCGTCCTCTGTGAGTGCTATGACGCAGGCTGTGCTTGGTATGAGC 
CGTACAAGCTGAGCAGGCAGGAGACACTCACGATACTGCGTCCGACACGAACCATACTCG 

ThrProAlaGluThrThrValArgLeuArgAlaTyrMetAsnThrProGlyLeuProVal 
3241 TCACGCCCGCCGAGACTACAGTTAGGCTACGAGCGTACATGAACACCCCGGGGCTTCCCG 
AGTGCGGGCGGCTCTGATGTCAATCCGATGCTCGCATGTACTTGTGGGGCCCCGAAGGGC 

CysGlnAspHisLeuGluPheTrpGluGlyValPheThrGlyLeuThrHisIleAspAla 
3301 TGTGCCAGGACCATCTTGAATTTTGGGAGGGCGTCTTTACAGGCCTCACTCATATAGATG 
ACACGGTCCTGGTAGAACTTAAAACCCTCCCGCAGAAATGTCCGGAGTGAGTATATCTAC 

HisPheLeuSerGlnThrLysGlnSerGlyGluAsnLeuProTyrLeuValAlaTyrGln 
3361 CCCACTTTCTATCCCAGACAAAGCAGAGTGGGGAGAACCTTCCTTACCTGGTAGCGTACC 
GGGTGAAAGATAGGGTCTGTTTCGTCTCACCCCTCTTGGAAGGAATGGACCATCGCATGG 

A laThrVal CysA laA rgA 1 aG 1 nA laProProProSe rTrpAspG 1 nMetTrpLysCys 
3421 AAGCCACCGTGTGCGCTAGGGCTCAAGCCCCTCCCCCATCGTGGGACCAGATGTGGAAGT 
TTCGGTGGCACACGCGATCCCGAGTTCGGGGAGGGGGTAGCACCCTGGTCTACACCTTCA 

LeuIleArgLeuLysProThrLeuHisGlyProThrProLeuLeuTyrArgLeuGlyAla 
3481 GTTTGATTCGCCTCAAGCCCACCCTCCATGGGCCAACACCCCTGCTATACAGACTGGGCG 
CAAACTAAGCGGAGTTCGGGTGGGAGGTACCCGGTTGTGGGGACGATATGTCTGACCCGC 

ValGlnAsnGluIleThrLeuThrHisProValThrLysTyrlleMetThrCysMetSer 
3541 CTGTTCAGAATGAAATCACCCTGACGCACCCAGTCACCAAATACATCATGACATGCATGT 
GACAAGTCTTACTTTAGTGGGACTGCGTGGGTCAGTGGTTTATGTAGTACTGTACGTACA 

AlaAspLeuGluValValThrSerThrTrpValLfeuValGlyGlyValLeuAlaAlaLeu 
3601 CGGCCGACCTGGAGGTCGTCACGAGCACCTGGGTGCTCGTTGGCGGCGTCCTGGCTGCTT 
GCCGGCTGGACCTCCAGCAGTGCTCGTGGACCCACGAGCAACCGCCGCAGGACCGACGAA 

AlaAlaTyrCysLeuSerThrGlyCysValVallleValGlyArgValValLeuSerGly 
3661 TGGCC6CGTATTGCCTGTCAACAGGCTGCGTGGTCATAGTGGGCAGGGTCGTCTTGTCCG 
ACCGGCGCATAACGGACAGTTGTCCGACGCACCAGTATCACCCGTCCCAGCAGAACAGGC 

LysProAlallelleProAspArgGluValLeuTyrArgGluPheAspGluMetGluGlu 
3721 GGAAGCCGGCAATCATACCTGACAGGGAAGTCCTCTACCGAGAGTTCGATGAGATGGAAG 
CCTTCGGCCGTTAGTATGGACTGTCCCTTCAGGAGATGGCTCTCAAGCTACTCTACCTTC 

CysSerGlnHisLeuProTyrlleGluGlnGlyMetMetLeuAlaGluGlnPheLysGln 
3781 AGTGCTCTCAGCACTTACCGTACATCGAGCAAGGGATGATGCTCGCCGAGCAGTTCAAGC 
TCACGAGAGTCGTGAATGGCATGTAGCTCGTTCCCTACTACGAGCGGCTCGTCAAGTTCG 

Lys A 1 aLeuG 1 yLeuLeuG 1 nThr A 1 aSe r ArgG 1 nA 1 aG 1 uVa 1 1 1 eA 1 aProA 1 aVa 1 
3841 AGAAGGCCCTCGGCCTCCTGCAGACCGCGTCCCGTCAGGCAGAGGTTATCGCCCCTGCTG 
TCTTCCGGGAGCCGGAGGACGTCTGGCGCAGGGCAGTCCGTCTCCAATAGCGGGGACGAC 

GInThrAsnTrpGlnLysLeuGluThrPheTrpAlaLysHisMetTrpAsnPhelleSer 
3901 TCCAGACCAACTGGCAAAAACTCGAGACCTTCTGGGCGAAGCATATGTGGAACTTCATCA 
AGGTCTGGTTGACCGTTTTTGAGCTCTGGAAGACCCGCTTCGTATACACCTTGAAGTAGT 

GlylleGlnTyrLeuAlaGlyLeuSerThrLeuProGlyAsnProAlalleAlaSerLeu 
3961 GTGGGATACAATACTTGGGGGGCTTGTCAACGCTGCCTGGTAACCCCGCCATTGCTTCAT 
CACCCTATGTTATGAACCGCCCGAACAGTTGCGACGGACCATTGGGGCGGTAACGAAGTA 



FIG. 47E 

MetAlaPheThrAlaAlaValThrSerProLeuThrThrSerGlnThrLeuLeuPheAsn 
4021 TGATGGCTTTTACAGCTGCTGTCACCAGCCCACTAACCACTAGCCAAACCCTCCTCTTCA 
ACTACCGAAAATGTCGACGACAGTGGTCGGGTGAnGGTGATCGGTTTGGGAGGAGAAGT 

IleLeuGlyGlyTrpValAlaAlaGlnLeuAlaAlaProGlyAlaAlaThrAlaPheVal 
1081 ACATATTGGGGGGGTGGGTGGCTGCCCAGCTCGCCGCCCCCGGTGCCGCTACTGCCTTTG 
TGTATAACCCCCCCACCCACCGACGGGTCGAGCGGCGGGGGCCACGGCGATGACGGAAAG 

GlyAlaGlyLeuAlaGlyAlaAlalleGlySerValGlyLeuGlyLysValLeuIleAsp 
4141 TGGGCGCTGGCTTAGCTGGCGCCGCCATCGGCAGTGTTGGACTGGGGAAGGTCCTCATAG 
ACCCGCGACCGAATCGACCGCGGCGGTAGCCGTCACAACCTGACCCCTTCCAGGAGTATC 

IleLeuAlaGlyTyrGlyAlaGlyValAlaGlyAlaLeuValAlaPheLysIleMetSer 
1201 ACATCCTTGCAGGGTATGGCGCGGGCGTGGCGGGAGCTCTTGTGGCATTCAAGATCATGA 
TGTAGGAACGTCCCATACCGCGCCCGCACCGCCCTCGAGAACACCGTAAGTTCTAGTACT 

GlyGluValProSerThrGluAspLeuValAsnLeuLeuProAlalleLeuSerProGly 
4261 GCGGTGAGGTCCCCTCCACGGAGGACCTGGTCAATCTACTGCCCGCCATCCTCTCGCCCG 
CGCCACTCCAGGGGAGGTGCCTCCTGGACCAGTTAGATGACGGGCGGTAGGAGAGCGGGC 

AlaLeuValValGlyValValCysAlaAlalleLeuArgArgHisValGlyProGlyGlu 
4321 GAGCCCTCGTAGTCGGCGTGGTCTGTGCAGCAATACTGCGCCGGCACGTTGGCCCGGGC6 
CTCGGGAGCATCAGCCGCACCAGACACGTCGTTATGACGCGGCCGTGCAACCGGGCCCGC 

GlyAlaValGlnTrpMetAsnArgLeuIleAlaPheAlaSerArgGlyAsnHisValSer 
4381 AGGGGGCAGTGCAGTGGATGAACCGGCTGATAGCCTTCGCCTCCCGGGGGAACCATGTTT 
TCCCCCGTCACGTCACCTACTTGGCCGACTATCGGAAGCGGAGGGCCCCCTTGGTACAAA 

ProThrHisTyrValProGluSerAspAlaAlaAlaArgValThrAlalleLeuSerSer 
4441 CCCCCACGCACTACGTGCCGGAGAGCGATGCAGCTGCCCGCGTCACTGCCATACTCAGCA 
GGGGGTGCGTGATGCACGGCCTCTCGCTACGTCGACGGGCGCAGTGACGGTATGAGTCGT 

LeuThrValThrGlnLeuLeuArgArgLeuHisGlnTrpIleSerSerGluCysThrThr 
4501 GCCTCACTGTAACCCAGCTCCTGAGGCGACTGCACCAGTGGATAAGCTCGGAGTGTACCA 
CGGAGTGACATTGGGTCGAGGACTCCGCTGACGTGGTCACCTATTCGAGCCTCACATGGT 

ProCysSerGlvSerTrpLeuArgAspI leTrpAspTrpI leCysGluValLeuSerAsp 
4561 CTCCATGCTCCGGTTCCTGGCTAAGGGACATCTGGGACTGGATATGCGAGGTGTTGW 
GAGGTACGAGGCCAAGGACCGATTCCCTGTAGACCCTGACCTATACGCTCCACAACTCGC 

PheLysThrTrpLeuLysAlaLysLeuMetProGlnLeuProGlylleProPheValSer 
4621 ACTnAAGACCTGGCTAAAAGCTAAGCTCATGCCACAGCTGCCTGGGATCCCCTTTGTG^ 
TGAAATTCTGGACCGATTTTCGATTCGAGTACGGTGTCGACGGACCCTAGGGGAAACACA 

CysG InArgGI yTy rLysG 1 y Va 1 TrpA rgVa 1 AspG 1 y 1 1 eMetH i sTh r A r gCysH i s 
4681 CCTGCCAGCGCGGGTATAAGGGGGTCTGGCGAGTGGACGGCATCATGCACACTC 

GGACGGTCGCGCCCATATTCCCCCAGACCGCTCACCTGCCGTAGTACGTGTGAGCGACGG 

Cy sG 1 yA 1 aG 1 u 1 1 eTh rG 1 yH i s Va 1 Ly s AsnG 1 yTh rfletA rg 1 1 eVa 1 G 1 y P ro Arg 
4741 ACTGTGGAGCTGAGATCACTGGACATGTCAAAAACGGGACGATGAGGATCGTCGGK 
TGACACCTCGACTCTAGTGACCTGTACAGTTTTTGCCCTGCTACTCCTAGCAGCCAGGAT 

ThrCysArgAsnMetTrpSerGlyThrPheProI leAsnA laTyrThrThrGlyProCys 
4801 GGACCTGCAGGAACATGTGGAGTGGGACCTTCCCCATTAATGCCTACACCACGGGCCCCT . 
CCTGGACGTCCTTGTACACCTCACCCTGGAAGGGGTAATTACGGATGTGGTGCCCGGGGA 

ThrProLeuProA 1 aProAsnTy rThrPheAl aLeuTr pA rgVa 1 Se r A 1 aG 1 uG 1 uTy r 
4861 GTACCCCCCnCCTGCGCCGAACTACACGTTCGCGCTATGGAGGGTGTCTGCAGAGGAAT 
CATGGGG6GAAGGACGCGGCTTGATGTGCAAGCGCGATACCTCCCACAGACGTCTCCTTA 

ValGluIleArgGlnValGlyAspPheHlsTyrValThrGlyMetThrThrAspAs^ 
4921 ATGTGGAGATAAGGCAGGTGGGGGACTTCCACTACGTGACGGGTATGACTACTGACAATC 
TACACCTCTATTCCGTCCACCCCCTGAAGGTGATGCACTGCCCATACTGATGACTGTTAG 

LysCysProCysG 1 nVa 1 ProSe rProG 1 uPhePheThrG 1 uLeuAspG 1 yVa 1 A rgLeu 
4981 TCAAATGCCCGTGCGAGGTCCCATCGCCCGAATTTTTCACAGAATTGGACGGG^ 

AGTTTACGGGCACGGTCCAGGGTAGCGGGCTTAAAAAGTGTCTTAACCTGCCCCACGCGG 
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H i sArgPheA 1 aProProCysLysProLeuLeuArgG 1 uG 1 uVa 1 SerPheArgVa lGly 
5041 TACATAGGTTTGCGCCCCCCTb^ 
AT6TATCCAAACGK 

LeuHisGluTyrProValGlySerGlnLeuProCysGluProGluProAspVa^ 
5101 GACTCCACGAATACCCGGTAGGGTCGCAATTACCTTGCGAGCCCGAACCGGAC6TGGCCG 
CTGAGGTGCTTATGGGCCATCCCAGCGTTAATGGAACGCTCGGGCTTGGCCTGCACCGGC 

LeuTh rSe rMe tLeuTh r AspProSerH i s 1 1 eTh r A 1 aG 1 uA 1 aA 1 aG 1 yA rgA rgLeu 
5161 TGTTGACGTCCATGCTCACTGATCCCTCCCATATAACAGCAGAGGCGGCCGGGCGAAGGT 
ACAACTGCAGGTACGAGTGACTAGGGAGGGTATATTGTCGTCTCCGCCGGCCCGCTTCCA 

■ AlaArgGlySerProProSerValAlaSerSerSerAlaSerGlnLeuSerAlaProSer 
5221 TGGCGAGGGGATCACCCCCCTCTGTGGCCAGCTCCTCGGCTAGCCAGCTATCCGCTCCA^ 
ACCGCTCCCCTAGTGGGGGGAGACACCGGTCGAGGAGCCGATCGGTCGATAGGCGAGGTA 

LeuLysA laThrCysThrAlaAsnH i sAspSerProAspA laG 1 uLeuI leG luA laAsn 
5281 CTCTCAAGGCAACTTGCACCGCTAACCATGACTCCCCTGATGCTGAGCTCATAGAGGCCA 
GAGAGTTCCGTTGAACGTGGCGATTGGTACTGAGGGGACTACGACTCGAGTATCTCCGGT 

LeuLeuTrpA rgG 1 nG 1 uMetG 1 yG 1 yAsn 1 1 eThr A rgVa 1G 1 uSe rG 1 uAsnLysVa 1 
5341 ACCTCCTATGGAGGCAGGAGATGGGCGGCAACATCAGCAGGGTTGAGTCAGAAAACAAAG 
TGGAGGATACCTCCGTCCTCTACCCGCCGTTGTAGTGGTCCCAACTCAGTCTTTTGTTTC 

VallleLeuAspSerPheAspProLeuValAlaGluGluAspGluArgGluIleSerVal 
5401 TGGTGATTCTGGACTCCTTCGATCCGCTTGTGGCGGAGGAGGACGAGCGGGAGATCTCCG 
ACCACTAAGACCTGAGGAAGCTAGGCGAACACCGCCTCCTCCTGCTCGCCCTCTAGAGGC 

ProA 1 aG 1 u I leLeuArgLysSer ArgArgPheA laGlnAl aLeuProVa lTrpA 1 aArg 
5461 TACCCGCAGAAATCCTGCGGAAGTCTCGGAGATTCGCCCAGGCCCTGCCCGTTTGGGCGC 
ATGGGCGTCTTTAGGACGCCTTCAGAGCCTCTAAGCGGGTCCGGGACGGGCAAACCCGCG 

ProAspTyrAsnProProLeuValGluThrTrpLysLysProAspTyrGluProProVal 
5521 GGCCGGACTATAACCCCCCGCTAGTGGAGACGTGGAAAAAGCCCGACTACGAACCACCTG 
CCGGCCTGATATTGGGGGGCGATCACCTCTGCACCTTTTTCGGGCTGATGCTTGGTGGAC 

ValHisGlyCysProLeuProProProLysSerProProValProProProArgLysL^^ 
5581 TGGTCCATGGCTGTCCGCTTCCACCTCCAAAGTCCCCTCCTGTGCCTCCGCCTCGGAAGA 
ACCAGGTACCGACAGGCGAAGGTGGAGGTTTCAGGGGAGGACACGGAGGCGGAGCCTTCT 

ArgThrValValLeuThrGluSerThrLeuSerThrAlaLeuAlaGluLeuAlaThrA^ 
5641 AGCGGACGGTGGTCCTCACTGAATCAACCCTATCTACTGCCTTGGCCGAG^ 

TCGCCTGCCACCAGGAGTGACTTAGTTGGGATAGATGACGGAACCGGCTCGAGCGGTGGT 

SerPheGlySerSerSerThrSerGlylleThrGlyAspAsnThrThrThrSerSer^ 
5701 GAAGCTTTGGCAGCTCCTCAACTTCCGGCATTACGGGCGACAA^ 

CTTCGAAACCGTCGAGGAGTTGAAGGCCGTAATGCCCGCTGTTATGCTGTTGTAGGAGAC 

ProA 1 aProSe rG lyCysProProAspSer AspA 1 aG 1 uSerTy rSerSerMe tProPro 
5761 AGCCCGCCCCnCTGG^GCCCCCCCGACTCCGACGCTGAGTa 

TCGGGCGGGGAAGACCGACGGGGGGGCTGAGGCTGCGACTCAGGATAAGGAGGTACGGGG 

LeuGluGlyGluProGlyAspProAspLeuSerAspGlySerTr^ 
5821 CCCTGGAGGGGGAGCCTGGGGATCCGGATCnAGCGACGGGTCATGGTC^ 

GGGACCTCCCCCTCGGACCCCTAGGCCTAGAATCGCTGCCCAGTACCAGTTGCCAGTCAT 

G 1 uA 1 aAsn A 1 aG 1 u AspVa 1 Va 1 CysCy sSe [MetSe rjy rSe rTr pTh rG 1 yA 1 aLeu 
5881 GTGAGGCCAACGCGGAGGATGTCGTGTGCTGCTCAATGTC^ 

CACTCCGGTTGCGCCTCCTACAGCACACGACGAGTTACAGAATGAGAACCTGTCCGCGTG 

ValThrProCysAlaAlaGluGluGlnLysLeuProIleAsnAlaLeuSerAsnSerU 
5941 TCGTCACCCCGTbCGCCGCGGAAGAACAGAAACTGCCCATCAATGCACTAAGCAACTCGT 
AGCAGTGGGGCACGCGGCGCCTTCTT6TCTTTGACGGGTAGTTACGTGATTCGTTGAGCA 

LeuArgHisHisAsnLeuValTyrSerThrThrSerArgSerAlaCysGlnArgGlnLys 
6001 TGCTACGTCACCACAATTTGGTGTATTCCACCACCTCACGCAGTGCTTGCC^ 

ACGATGCAGTGGTGTTAAACCACATAAGGTGGTGGAGTGCGTCACGAACGGTTTCCGTCT 



FIG. 47G 

6061 AGA^AGTCACATTTGaEaG^ 

TCTTJCAGTGTAAACTGTCTGACGTTCAAGACCT.GTCG^^ 

cggtacggtctttccggcattgggtgtagttgaggcacM 
tacattgtggttatctgtgatggtagtaccgattcttgctc^ 

ggttcttttggggttaccccaagagcatactatmg^ 

6601 Ag|Eg1M^ 

tctcgctgtaggcatgcctcctccgttagatggttacaacactggagctgggggttw 

6661 GcffiicM 

CGCACCGGTAGnCAGGGAGTGGCTCTCCGAAATACAACCCCCGGGAGAAT^ 

6781 ^wmmmmmimt 
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FIG. 54A 

i mmmmmsmamat 

CAGGCCCAAGACCTTCTGCCGCACTTGATAC^ 
181 If Tjirf Tr rf Vrf yilakSJIteuSerCysLe uThrVa 1 ProA 1 aSe rA 1 aTy rG 1 nVa 1 

m smmmmmimmmm. 

GAGATGCACCCCCTGGATACGCCCAGACAGAAAGAACAGCCGGnUcAM 



421 
481 
541 

601 GG 



FIG. 54B 

GACGACGATAAACGGCCGCAGCTGCGCCTTTGGGTGCAGTGGCCCCCTTCACGGCCG^ 

o , i I!3ry§iSe rG lyPheVa 1 Ser LeuLeuA 1 aProG 1 yA 1 aLysG 1 nAsnVa 1 G 1 hLeu 1 1 e 
m ^T6IGTCTGGATTTGTTAGCCTCCTCGCACCAGGCGCCAAGCAGAACGTa 
TGACACAGACCTAAACAATCGGAGGAGCGTGGTCCGCGGT^ 

TTGTGGTTGCCGTCAACCGTGGAGTTATCGTGCCGGGACTTGACGTTACTATCGM 

961A T M^ 

TGGCCGACCAACCGTCCCGAAAAGATAGTGGTG^ 

i M , ^fSLeuAlaSe rCys A rgProLeuTh r AspPheAspG ] nG 1 yTrpG 1 yProI 1 eSerTvr 
1021 AGGCTAGCCAGCTGCCGACCCCnACCGATTnGACCAGGGCTGGGK 
TCCGATCGGTCGACGGCTGGGGAATGGCTAAAACTGGTCCCGACCC^ 

1081 GCCAA S cffi^ 

CGGnGGCnCGCCGGGGCTGGTCGCGGGGATGACGACCGTGA^ 

„., GlyUlValProAlaLysSerValCysGlyProValTyrCysPheThrProSerProVal 

CACCACCCTTGCTGGCTGTCCAGCCCGCGCGGGTGGATGTCGACCCCACTTTTACTATGC 

1261 rTrrIrIIrrI^ITMPMT^^ G ^CCACCGCTGGGCAATTGGTTCGGTTGTACCTGG 
CTGCAGAAGCAGGAATTGTTATGGTCCGGTGGCGACCCGTTAACCAAGCCAACATGGACC 

1521 mmimimwmmmMm® 

CCGTTGTTGTGGGACGTGACGGGGTGACTAACGAAGGCGTTCGTAGGCCTGCGGTGTATG 

AGAGGCACGCCGAGGCCAGGGACCTAGTGTGGGTCCACGGACCAGCTGATGGGCATATCC 
. mi ^M^Sf^^Mil^^^ 1 lePheLysI leArgMetTyrValGly 

GAAACCGTAATAGGAACATGGTAGTTGATGTGGTATAAATTTTAGTCCTACATGCACCCT 
. . GlvValGl uH i sArgLeuG 1 uA 1 aA 1 aCys AsnTrpTh rA rgG lyGluA rgCysAspLeu 

CTTCTGTCCCTGTCCAGGCTCGAGTCGGGCAATGACGACTGGTGATGTGTCACCGTCCAG 



FIG. 54C 

1681 CTCCCGTGTTCCTTCACAACOT 

GAGGGCACAAGGAAGTGTTGGGATGGTCGGAACAGGTGGCCGGAGTAGGTGGAGGTGGTC 

, 7M ^?P!i?y?JlAspyalGlnTyrLeuTyrGlyValGlySerSerI leAlaSerTrpAlal le 

17,1 miimmmmmmimimu 

TTCACCCTCATGCAGCAAGAGGACAAGGAAGACGAACGTCTGCGCGCGCAGACGAGGACG 
. oc . LeuTrpMetMetLeuLeu I leSerG 1 nA laG 1 uA 1 aA 1 aLeuG 1 uAshLeuVa UleLeu 

1921 MffgM^ 

1981 iffifflftS^^ 

CGTACCATAAACTTCCCATTCAC.CCACGGGCCTCGCCAGATGTGGAAGATGCCCTACACC 
GGAGAGGAGGACGAGGACAACCGCAACGGGGTCGCCCGCATGCGCGACCTGTGCCTCCAC 
CGGCGCAGCACACCGCCACAACAAGA6CAGCCCAACTACCGCGACTGAGACAGTGGTATA 

ATGTTCGCGATATAGTCGACCACGAACACCACCGAAGTCATAAAAGACTG6TCTCACCTT 

AlgGlnLeuHisValTrpI 1 eProProLeuAsnVa 1 A rgG 1 yG 1 vA rgAsD A 1 aVa 1 1 le 
2221 GCGCAACTGCACGTGTGGATO 

CGCGTTGACGTGCACACCTAAGGGGGGGAGTTGCAGGCTCCCCCCGCGCTGCGGCAGTAG 

2281 IlT^MffiMK^^ 

AATGAGTACACACGACATGTGGGCTGAGACCATAAACTGTAGTGGTTTAACGACGACCGG 

™, i rr)$$&ft oLeuJrp J 1 eLeuG 1 nA 1 aSerLeuLeuLysVa 1 ProTy rPheVa 1 A rg 
2m rIffl^^^ni^$IT^n^^^ GTTT6CTTAAA 6TACCCTkTTTGTGCGC 
CAGAAGCCTGGGGAAACCTAAGAAGTTCGGTCAAACGAATTTCATGGGATGAAACACGCG 

o/.m nGlVLeuLeuArgPheCysA 1 aLeuA LaArgLysMetl leGlyGlyHi sTyrVa 1 
2W1 GTCCAAGGCCTTCTCCGGTTCTGCGCGTTAGCGCGGAAGATGATCGGAGGC^ 

CAGGTTCCGGAAGAGGCCAAGACGCGCAATCGCGCCTTCTACTAGCCTCCGGTAATGCAC 

„ , „ , G 1 nMetVa 1 1 1 e 1 1 eLysLeuG 1 yA 1 aLeuThrG 1 yTh rTy rVa 1 Ty r AsnH i sLeuTh r 
2461 CMATGGTCATCAnMGTTA^ 

GTTTACCAGTAGTAATTCAATCCCCGCGAATGACCGTGGATACAAATATTGGTAGAGTGA 

Prol^ArgAs^rpAlaHlsAsn6^ 1 aVa 1 A 1 aValG 1 uProVa 1 

GGA6AAGCCCTGACCCGCGTGTTGCCGAACGCTCTAGACCGGCACCGACATCTCGGTCAG 

X? j| 5 1 UjhCLysLeuI 1 eThrTrpG lyA 1 aAsgTh rA 1 aA 1 aCysG 1 y 

CAGAAGAGGGTTTACCTCTGGTTCGAGTAGTGCACCCCCCGTCTATGGCGGCGCACGCCA 

2611 GACATCATCAACGGCTTGCCTGH 

CTGTAGTAGTTGCCGAACGGACAAAGGCGGGCGTCCCCGGCCCTCTATGACGAGCCCGGT 



FIG. 54D 

97n i ? tVa 1 SerLysG 1 yTrpArgLeuLeuA 1 aPro 1 1 eThrA 1 aTy rA laGln 

2701 

2761 MM 

GTCTGTTCCCCGGAGGATCCCACGTATTAGTGGTCGGATTGACCGGCCCTGTTTn 

2821 Gfi§M^ 

cacctcccactccaggtctaacacagttgacgacgggtttggaaggaccgtS 

2881 ||M^G P M 

ttaccccacacgacctgacagatggtgccccggccttgc^ 
ccaggacagtaggtctacatatggttacatctggttctggaacacccgacw 

,„ ftl £]!^ySsrArgSerLe^ 

3001 caaggtagccgctcattgacaccctgc 

GTTCCATCGGCGAGTAACTGTGGGACGTGAACGCCGAGGAGCCTGGAAATGG^ 
7ni . . A rgH i sA 1 aAspVa 1 1 1 eProVa 1 A rgA rgA rgG 1 yAspSerA rgG 1 ySerLeuLeuSer 

GGGGCCGGGTAAAGGATGAACTTTCCGAGGAGCCCCCCAGGCGACAACACGGGGCGCCCC 
71 01 ^^§y§I51ynePheArgAlaAlaValCysThrArgGlyValAlaLysAlaValAsp 

AAATAGGGACACCTCTTGGATCTCTGTTGGTACTCCAGGGGCCACAAGTGCCTA^ 

SgiProProValValProGlnSerPheGlnValAlaHisLeuHisAlaProThrGiv^pr 
3301 TCTCCACCAGTAGTGCCCCAGAGCTTCCAGGTGGCTCACCT 

AGAGGTGGTCATCACGGGGTCTCGAAGGTCCACCGAGTGGAGGTACGAGG^ 

T66ATGCCGTTCAAGGAAC6GCTGCCGCCCACGAGCCCCCCGC6AATACTGTAnATTAA 

,,„, {!!?5§QGluCysHisSerThrAspAlaThrSerIleLeuGlyIleGlyThrValLeuAsD 
3601 JGJGACGAGTGCCACTCCACGGATGCCACATCCATCTTGGGCATCGGCACTGK 

ACACTGCTCACGGTGAGGTGCCTACGGTGTAGGTAGAACCCGTAGCCGTGACAGGAA^ 



FIG. 54E 

GlnAlaGluThrAlaGlyAlaArgLeuValValLeuAlaThrAlaThrProProGlySer 
3661 CAAGCAGAGACTGCGGGGGCGAGACTGGTTGTGCTCGCCACCGCCACCCCTCCGGGCTCC 
GTTCGTCTCTGACGCCCCCGCTCTGACCAACACGAGCGGTGGCGGTGGGGAGGCCCGAGG 

ValThrValProHisProAsnlleGluGluValAlaLeuSerThrThrGlyGluIlePro 
3721 GTCACTGTGCCCCATCCCAACATCGAGGAGGTTGCTCTGTCCACCACCGGAGAGATCCCT 
CAGTGACACGGGGTAGGGTTGTAGCTCCTCCAACGAGACAGGTGGTGGCCTCTCTAGGGA 

PheTyrGlyLysAlalleProLeuGluVallleLysGlyGlyArgHisLeuIlePheCys 
3781 TTTTACGGCAAGGCTATCCCCCTCGAAGTAATCAAGGGGGGGAGACATCTCATCTTCTGT 
AAAATGCCGTTCCGATAGGGGGAGCTTCATTAGTTCCCCCCCTCTGTAGAGTAGAAGACA 

HisSerLysLysLysCysAspGluLeuAlaAlaLysLeuValAlaLeuGlylleAsnAla 
3841 CATTCAAAGAAGAAGTGCGACGAACTCGCCGCAAAGCTGGTCGCATTGGGCATCAAT6CC 
GTAAGTTTCTTCTTCACGCTGCTTGAGCGGCGTTTCGACCAGCGTAACCCGTAGTTACGG 

ValAlaTyrTyrArgGlyLeuAspValSerVallleProThrSerGlyAspValValVal 
3901 GTGGCCTACTACCGCGGTCTTGACGTGTCCGTCATCCCGACCAGCGGCGATGTTGTCGTC 
CACCGGATGATGGCGCCAGAACTGCACAGGCAGTAGGGCTGGTCGCCGCTACAACAGCAG 

ValAlaThrAspAlaLeuMetThrGlyTyrThrGlyAspPheAspSerVallleAspCys 
3961 GTGGCAACCGATGCCCTCATGACCGGCTATACCGGCGACTTCGACTCGGTGATAGACTGC 
CACCGTTGGCTACGGGAGTACTGGCCGATATGGCCGCTGAAGCTGAGCCACTATCTGACG 

AsnThrCysVa IThrGI nThrVa 1 AspPheSerLeuAspProThrPheThr I leG 1 uThr 
4021 AATACGTGTGTCACCCAGACAGTCGATTTCAGCCTTGACCCTACCTTCACCATTGAGACA 
TTATGCACACAGTGGGTCTGTCAGCTAAAGTCGGAACTGGGATGGAAGTGGTAACTCTGT 

IleThrLeuProGlnAspAlaValSerArgThrGlnArgArgGlyArgThrGlyArgGly 
4081 ATCACGCTCCCCCAGGATGCTGTCTCCCGCACTCAACGTCGGGGCAGGACTGGCAGGGGG 
TAGTGCGAGGGGGTCCTACGACAGAGGGCGTGAGTTGCAGCCCCGTCCTGACCGTCCCCC 

LysProGlylleTyrArgPheValAlaProGlyGluArgProSerGlyMetPheAspSer 
1141 AAGCCAGGCATCTACAGATTTGTGGCACCGGGGGAGCGCCCCTCCGGCATGTTCGACTCG 
TTCGGTCCGTAGATGTCTAAACACCGTGGCCCCCTCGCGGGGAGGCCGTACAAGCTGAGC 

SerValLeuCysGluCysTyrAspAlaGlyCysAlaTrpTyrGluLeuThrProAlaGlu 
4201 TCCGTCCTCTGTGAGTGCTATGACGCAGGCTGTGCTTGGTATGAGCTCACGCCCGCCGAG 
AGGCAGGAGACACTCACGATACTGCGTCCGACACGAACCATACTCGAGTGCGGGCGGCTC 

ThrThrValArgLeuArgAlaTyrMetAsnThrProGlyLeuProValCysGlnAspHis 
4261 ACTACAGTTAGGCTACGAGCGTACATGAACACCCCGGGGCTTCCCGTGTGCCAGGACCAT 
TGATGTCAATCCGATGCTCGCATGTACTTGTGGGGCCCCGAAGGGCACACGGTCCTGGTA 

LeuGluPheTrpGluGlyValPheThrGlyLeuThrHisIleAspAlaHisPheLeuSer 
4321 CTTGAATTTTGGGAGGGCGTCTTTACAGGCCTCACTCATATAGATGCCCACTTTCTATCC 
GAACTTAAAACCCTCCCGCAGAAATGTCCGGAGTGAGTATATCTACGGGTGAAAGATAGG 

GlnThrLysGlnSerGlyGluAsnLeuProTyrLeuValAlaTyrGlnAlaThrValCys 
4381 CAGACAAAGCAGAGTGGGGAGAACCTTCCTTACCTGGTAGCGTACCAAGCCACCGTGTGC 
GTCTGTTTCGTCTCACCCCTCTTGGAAGGAATGGACCATCGCATGGTTCGGTGGCACACG 

AlaArgAlaGlnAlaProProProSerTrpAspGlnMetTrpLysCysLeuIleArgLeu 
4441 GCTAGGGCTCAAGCCCCTCCCCCATCGTGGGACCAGATGTGGAAGTGTTTGATTCGCCTC 
CGATCCCGAGTTCGGGGAGGGGGTAGCACCCTGGTCTACACCTTCACAAACTAAGCGGAG 

LysProThrLeuHi sG 1 yProThrProLeuLeuTy rArgLeuG lyA laValGl nAsnG 1 u 
4501 AAGCCCACCCTCCATGGGCCAACACCCCTGCTATACAGACTGGGCGCTGTTCAGAATGAA 
TTCGGGTGGGAGGTACCCGGTTGTGGGGACGATATGTCTGACCCGCGACAAGTCTTACTT 

IleThrLeuThrHisProValThrLysTyrlleMetThrCysMetSerAlaAspLeuGlu 
4561 ATCACCCTGACGCACCCAGTCACCAAATACATCATGACAT6CATGTCGGCCGACCTGGAG 
TAGTGGGACTGCGTGGGTCAGTGGTTTATGTAGTACTGTACGTACAGCCGGCTGGACCTC 

Va 1 Va 1 Thr SerThrTrpVa 1 LeuVa 1 Gl yG 1 yVa ] LeuA 1 aA 1 aLeuA 1 aA 1 aTy rCys 
4621 GTCGTCACGAGCACCTGGGTGCTCGTTGGCGGCGTCCTGGCTGCTTTGGCCGCGTATTGC 
CAGCAGTGCTCGTGGACCCACGAGCAACCGCCGCAGGACCGACGAAACCGGCGCATAACG 



FIG. 54F , , 

LeuSerThrGlyCysValVallleValGlyArgVaWa^ 
4681 CTGTCAACAGGCTGCGTGGTCATAGTGGGCAGGGTCGTCTTGTCCGGGAAGCCGGCAATC 
GACAGTTGTCCGACGCACCAGTATCACCCGTCCCAGCAGAACAGGCCCTTCGGCCGTTAG 

IleProAspArgGluValLeuTyrArgGluPheAspGluMetGluGluCysSerGIn 
4741 ATACCTGACAGGGAAGTCCTCTACCGAGAGTTCGATGAGATGGAAGAGTGCTCTCAGCAC 
TATGGACTGTCCCTTCAGGAGATGGCTCTCAAGCTACTCTACCTTCTCACGAGAGTCGTG 

LeuProTyrlleGluGlnGlyMetMetLeuAlaGluGlnPheLysGlnLysAlaLeuGly 
4801 TTACCGTACATCGAGCAAGGGATGATGCTCGCCGAGCAGTTCAAGCAGAAGGCCCTC 
AATGGCATGTAGCTCGTTCCCTACTACGAGCGGCTCGTCAAGTTCGTCTTCCGGGAGCCG 

LeuLeuGlnThrAlaSerArgGlnAlaGluVallleAIaProAlaValGlnThr^ 
4861 CTCCTGCAGACCGCGTCCCGTCAGGCAGAGGTTATCGCCCCTGCTGTCCAGACCAACTGG 
GAGGACGTCTGGCGCAGGGCAGTCCGTCTCCAATAGCGGGGACGACAGGTCTGGTTGACC 

GlnLysLeuGluThrPheTrpAlaLysHlsMetTrpAsnPhelleSerGlylleGlnT^^ 
4921 CAAAAACTCGAGACCTTCTGGGCGAAGCATATGJGGAACJTCATC 

GTTTTTGAGCTCTGGAAGACCCGCTTCGTATACACCTTGAAGTAGTCACCCTATGTTATG 

LeuAlaGlyLeuSerThrLeuProGlyAsnProAlalleAlaSerLeuMetAlaPheThr 
4981 TTGGCGGGCTTGTCAACGCTGCCTGGTAACCCCGCCATTGCTTCATTGATGGCTTjm 
AACCGCCCGAACAGTTGCGACGGACCATTGGGGCGGTAACGAAGTAACTACCGAAAATGT 

AlaAlaVal Th rSe r ProLeuThrTh rSe rG 1 nTh r LeuLeuPheAsn 1 1 eLeuG 1 yG 1 y 
5041 GCTGCTGTCACCAGCCCACTAACCACTAGCCAAACCCTCCTCTTCAACATATTGGGGGGG 
CGACGACAGTGGTCGGGTGATTGGTGATCGGTTTGGGAGGAGAAGTTGTATAACCCCCCC 

Tr pVa 1 A laA laG 1 nLeuA 1 aA 1 aP roG 1 yA la A 1 aTh r A 1 aPheVa 1 G 1 yA laG 1 yLeu 
5101 TGGGTGGCTGCCCAGCTCGCCGCCCCCGGTGCCGCTACTGCCTTTGTGGGCGCTGGCTTA 
ACCCACCGACGGGTCGAGCGGCGGGGGCCACGGCGATGACGGAAACACCCGCGACCGAAT 

AlaGlyAlaAlal leGlySerValGlyLeuGlyLysValLeuneAspHeLeuAlaGly 
5161 GCTGGCGCCGCCATCGGCAGTGTTGGACTGGGGAAGGTCCTCATAGACATCCTTGCAGGG 
CGACCGCGGCGGTAGCCGTCACAACCTGACCCCTTCCAGGAGTATCTGTAGGAACGTCCC 

Ty rG lyA 1 aG 1 yVa 1 A 1 aG 1 yA 1 aLeuVa 1 A 1 aPheLys 1 1 eMetSe rG 1 yG 1 uVa 1 P ro 
5221 TATGGCGCGGGCGTGGCGGGAGCTCTTGTGGCATTCAAGATCATGAGCGGTGAGGTCCCC 
ATACCGCGCCCGCACCGCCCTCGAGAACACCGTAAGTTCTAGTACTCGCCACTCCAGGGG 

SerThrGluAspLeuValAsnLeuLeuProAlalleLeuSerProGlyAlaLeuValVa^ 
5281 TCCACGGAGGACCTGGTCAATCTACTGCCCGCCATCCTCTCGCCCGGAGCCCTCGTAGTC 
AGGTGCCTCCTGGACCAGTTAGATGACGGGCGGTAGGAGAGCGGGCCTCGGGAGCATCAG 

G 1 y Va 1 Va 1 Cys A 1 a A 1 a 1 1 eLeuA rg ARgH i s Va 1 G 1 yProG 1 yG 1 uG 1 yAlaValG 1 n 
5341 GGCGTGGTCTGTGCAGCAATACTGCGCCGGCACGTTGGCCCGGGCGAGGGGG^ 

CCGCACCAGACACGTCGTTATGACGCGGCCGTGCAACCGGGCCCGCTCCCCCGTCACGTC 

TrpMe t Asn A r gLeu 1 1 eA 1 aPhe A 1 aSe r A rgG 1 yAsnH i sVal Se rProTh rH i sTy r 
5401 TGGATGAACCGGCTGATAGCCTTCGCCTCCCGGGGGAACCATGTJ^ 

ACCTACTTGGCCGACTATCGGAAGCGGAGGGCCCCCTTGGTACAAAGGGGGTGCGTGATG 

Va 1 P roG 1 uSer AspA 1 a A 1 aA 1 a A rg Va 1 Th r A 1 a 1 1 eLeuSe rSe r LeuTh rVa 1 Th r 
5461 GTGCCGGAGAGCGATGCAGCTGCCCGCGTCACTGCCATACTCAGCAGC^ 

CACGGCCTCTCGCTACGTCGACGGGCGCAGTGACGGTATGAGTCGTCGGAGTGACATTGG 

G InLeuLeuA rgA rgLeuH i sG lnTrp I leSerSerG 1 uCysTh rTh rProCysSerG ly 
5521 CAGCTCCTGAGGCGACTGCACCAGTGGATAAGCTCGGAGTGTACCACTCCATG^ 

GTCGAGGACTCCGCTGACGTGGTCACCTATTCGAGCCTCACATGGTGAGGTACGAGGCCA 

SerTrpLeuArgAspI leTrpAspTrpI leCysGluValLeuSerAspPheLysThrTrp 
5581 TCCTGGCTAAGGGACATCTGGGACTGGATATGCGAGGTGTTGAGCGACJTTAAGA^ 

AGGACCGATTCCCTGTAGACCCTGACCTATACGCTCCACAACTCGCTGAAATTCTGGACC 

LeuLysAlaLysLeuMetProGlnLeuProGlylleProPheValSerCysGlnArgGly 
5641 CTAAAAGCTAAGCTCATGCCACAGCTGCCTGGGATCCCCTTTGTGTCCTGCCAGCG^ 
GATTTTCGATTCGAGTACGGTGTCGACGGACCCTAGGGGAAACACAGGACGGTCGCGCCC 



FIG. 54G 

5701 




6121 



FIG. 54H 

6721 ICC^M 

AGGAGTTGAAGGCCGTAATGCCCGCTGTTATGCTGnS 

6781 if& P cTO 

CCGACGGGGGGGCTGAGGCTGCGACTCAGGATAAGGAGGTACGGG^^ 

68.1 

GGA-CCCCTAGGCCTAGAATCGCTGCCCAGTACCAGTTGCCA^ 

7021 flffipRfM^ 

TTAAACCACATAAGGTGGTGGAGTGCGTCACGAACGGTTTCCGT^ 

7no , AspARgLeuG 1 nVal LeuAspSerH i sTy rG InAspVa 1 LeuLysG 1 uVa 1 LvsA 1 aA 1 a 

7081 

CGCAGTTTTCACT l CCGATTGAACGATAGGCATCTCCTTCGAACGTCGGACTGCGGGGGT 



7fi?i IPrri rr lrr^?U?T3ff 51 ?SKf?5ff A§Rt euAs P pr oG lnA 1 aA rgVa 1 A la 1 1 eLys 



FIG. 541 

GlyTyrArgArgCysArgAlaSerGlyValLeuThrThrSerCysGlyAsnThrLeuThr 
7741 GGCTATCGCAGGTGCCGCGCGAGCGGCGTACTGACAACTAGCTGTGGTAACACCCTCACT 
CCGATAGCGTCCACGGCGCGCTCGCCGCATGACTGTTGATCGACACCATTGTGGGAGTGA 

CysTyrlleLysAlaArgAlaAlaCysArgAlaAlaGlyLeuGlnAspCysThrMetLeu 
7801 TGCTACATCAAGGCCCGGGCAGCCTGTCGAGCCGCAGGGCTCCAGGACTGCACCATGCTC 
ACGATGTAGTTCCGGGCCCGTCGGACAGCTCGGCGTCCCGAGGTCCTGACGTGGTACGAG 

Va 1 CysG 1 yAspAspLeuVa 1 Va 1 1 1 eCy sG 1 uSer A laG lyValGlnGl uAspA laAla 
7861 GTGTGTGGCGACGACTTAGTCGTTATCTGTGAAAGCGCGGGGGTCCAGGAGGACGCGGCG 
CACACACCGCTGCTGAATCAGCAATAGACACTTTCGCGCCCCCAGGTCCTCCTGCGCCGC 

SerLeuArgAlaPheThrGluAlaMetThrArgTyrSerAlaProProGlyAspProPro 
7921 AGCCTGAGAGCCTTCACGGAGGCTATGACCAGGTACTCCGCCCCCCCTGGGGACCCCCCA 
TCGGACTCTCGGAAGTGCCTCCGATACTGGTCCATGAGGCGGGGGGGACCCCTGGGGGGT 

GlnProGluTyrAspLeuGluLeuIleThrSerCysSerSerAsnValSerValAlaHis 
7981 CAACCAGAATACGACTTGGAGCTCATAACATCATGCTCCTCCAACGTGTCAGTCGCCCAC 
GTTGGTCTTATGCTGAACCTCGAGTATTGTAGTACGAGGAGGTTGCACAGTCAGCGGGTG 

AspGlyAlaGlyLysArgValTyrTyrleuThrArgAspProThrThrProLeuAlaArg 
8041 GACGGCGCTGGAAAGAGGGTCTACTACCTCACCCGTGACCCTACAACCCCCCTCGCGAGA 
CTGCCGCGACCTTTCTCCCAGATGATGGAGTGGGCACTGGGATGTTGGGGGGAGCGCTCT 

AlaAlaTrpGluThrAlaArgHisThrProValAsnSerTrpLeuGlyAsnl lei leMet 
8101 GCTGCGTGGGAGACAGCAAGACACACTCCAGTCAATTCCTGGCTAGGCAACATAATCATG 
CGACGCACCCTCTGTCGTTCTGTGTGAGGTCAGTTAAGGACCGATCCGTTGTATTAGTAC 

PheAlaProThrLeuTrpAlaArgMetlleLeuMetThrHisPhePheSerValLeuIle 
8161 TTTGCCCCCACACTGTGGGCGAGGATGATACTGATGACCCATTTCTTTAGCGTCCTTATA 
AAACGGGGGTGTGACACCCGCTCCTACTATGACTACTGGGTAAAGAAATCGCAGGAATAT 

AlaArgAspGlnLeuGluGlnAlaLeuAspCysGluI leTyrGlyAlaCysTyrSerl le 
8221 GCCAGGGACCAGCTTGAACAGGCCCTCGATTGCGAGATCTACGGGGCCTGCTACTCCATA 
CGGTCCCTGGTCGAACTTGTCCGGGAGCTAACGCTCTAGATGCeCCGGACGATGAGGTAT 

GluProLeuAspLeuProProI lei leGlnArgLeu 
8281 GAACCACTTGATCTACCTCCAATCATTCAAAGACTC 
CTTGGTGAACTAGATGGAGGTTAGTAAGTTTCTGAG 
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FIG. 62A 

CACTCCACCATGAATCACTCCCCTGT6A6GAACTACT6TCTTCACGCA6AAAGCGTCTA6 
CCATGGCGTTAGTATGAGTGTCGTGCAGCCTCCAGGACCCCCCCTCCCGGGAGAGCCATA 
GT6GTCTGCGGAACCGGTGAGTACACCGGAATTGCCAGGACGACCGGGTCCTTTCTTGGA 
TCAACCCGCTCAATGCCTGGAGATTTGGGCGTGCCCCCGCAAGACTGCTAGCCGAGTAGT 
GTTGGGTCGCGAAAGGCCTTGTGGTACTGCCTGATAGGGTGCTTGCGAGTGCCCCGGGAG-300 

— (Putative initiator methionine codon) 
GTCTCGTAGACCGTGCACCATGAGCACGAATCCTAAACCTCAAAAAAAAAACAAACGTAA 
CACCAACCGTCGCCCACAGGACGTCAAGTTCCCGGGTGGCGGTCAGATCGTTGGTGGAGT 
TTACTTGTTGCCGCGCAGGGGCCCTAGATTGGGTGTGCGCGCGACGAGAAAGACTTCCGA 
GCGGTCGCAACCTCGAGGTAGACGTCAGCCTATCCCCAAGGCTCGTCGGCCCGAGGGCAG 
GACCTGGGCTCAGCCCGGGTACCCTTGGCCCCTCTATGGCAATGAGGGCTGCGGGTGGGC-600 
GGGATGGCTCCTGTCTCCCCGTGGCTCTCGGCCTAGCTGGGGCCCCACAGACCGCCGGCG 
TAGGTCGCGCAATTTGGGTAAGGTCATCGATACCCTTACGTGCGGCTTCGCCGACCTCAT 
GGGGTACATACCGCTCGTCGGCGCCCCTCTTGGAGGCGCTGCCAGGGCCCTGGCGCATGG 
CGTCCGGGTTCTGGAAGACGGCGTGAACTATGCAACAGGGAACCTTCCTGGTTGCTCTTT 
CTCTATCTTCCTTCTGGCCCTGCTCTCTTGCTTGACTGTGCCCGCTTCGGCCTACCAAGT-900 
GCGCAACTCCACGGGGCTTTACCACGTCACCAATGATTGCCCTAACTCGAGTATTGTGTA 
CGAGGCGGCCGATGCCATCCTGCACACTCCGGGGTGCGTCCCTTGCGTTCGTGAGGGCAA 
CGCCTCGAGGTGTTGGGTGGCGATGACCCCTACGGTGGCCACCAGGGATGGCAAACTCCC 
CGC.GACGCAGCTTCGACGTCACATCGATCTGCTTGTCGGGAGCGCCACCCTCTGTTCGGC 
CCTCTACGTGGGGGACCTATGCGGGTCTGTCTTTCTTGTCGGCCAACTGTTCACCTTCTC-1200 
TCCCAGGCGCCACTGGACGACGCAAGGTTGCAATTGCTCTATCTATCCCGGCCATATAAC 
GGGTCACCGCATGGCATGGGATATGATGATGAACTGGTCCCCTACGACG6CGTTGGTAAT 
GGCTCAGCTGCTCCGGATCCCACAAGCCATCTTGGACATGATCGCTGGTGCTCACTGGGG 
AGTCCTGGCGGGCATAGCGTATTTCTCCATGGTGGGGAACTGGGCGAAGGTCCTGGTAGT 
GCTGCTGCTATTTGCCGGCGTCGACGCGGAAACCCACGTCACCGGGGGAAGTGCCGGCCA-1500 
CACTGTGTCTGGATTTGTTAGCCTCCTCGCACCAGGCGCCAAGCAGAACGTCCAGCTGAT 
CAACACCAACGGCAGTTGGCACCTCAATAGCACGGCCCTGAACTGCAATGATAGCCTCAA 
CACCGGCTGGTTGGCAGGGCTTTTCTATCACCACAAGTTCAACTCTTCAGGCTGTCCTGA 
GAGGCTA6CCAGCTGCCGACCCCTTACCGATTTTGACCAGGGCTGGGGCCCTATCAGTTA 
TGCCAACGGAAGCGGCCCCGACCAGCGCCCCTACTGCTGGCACTACCCCCCAAAACCTTG-1800 
CGGTATTGTGCCCGCGAAGAGTGTGTGTGGTCCGGTATATTGCTTCACTCCCAGCCCCGT 
GGTGGTGGGAACGACCGACAGGTCGGGCGCGCCCACCTACAGCTGGGGTGAAAATGATAC 
GGACGTCTTCGTCCTTAACAATACCAGGCCACCGCTGGGCAATTGGTTCGGTTGTACCTG 
GATGAACTCAACTGGATTCACCAAAGTGTGCGGAGCGCCTCCTTGTGTCATCGGAGGGGC 
GGGCAACAACACCCTGCACTGCCCCACTGATTGCTTCCGCAAGCATCCGGACGCCACATA-2100 
CTCTCGGTGCGGCTCCGGTCCCTGGATCACACCCAGGTGCCTGGTCGACTACCCGTATAG 
GCTTTGGCATTATCCTTGTACCATCAACTACACCATATTTAAAATCAGGATGTACGTGGG 
AGGGGTCGAACACAGGCTGGAAGCTGCCTGCAACTGGACGCGGGGCGAACGTTGCGATCT 
GGAAGACAGGGACAGGTCCGAGCTCAGCCCGTTACTGCTGACCACTACACAGTGGCAGGT 
CCTCCCGTGTTCCTTCACAACCCTACCAGCCTTGTCCACCGGCCTCATCCACCTCCACCA-2400 
GAACATTGTGGACGTGCAGTACTTGTACGGGGTGGGGTCAAGCATCGCGTCCTGGGCCAT 
TAACTGGGAGTACGTCGTTCTCCTGTTCCTTCTGCTTGCAGACGCGCGCGTCTGCTCCTG 
CTTGTGGATGATGCTACTCATATCCCAAGCGGAGGCGGCTTTGGAGAACCTCGTAATACT 
TAATGCAGCATCCCTGGCCGGGACGCACGGTCTTGTATCCTTCCTCGTGTTCTTCTGCTT 
TGCATGGTATTTGAAGGGTAAGTGGGTGCCCGGAGCGGTCTACACCTTCTACGGGATGTG-2700 
GCCTCTCCTCCTGCTCCTGTTGGCGTTGCCCCAGCGGGCGTACGCGCTGGACACGGAGGT 
GGCCGCGTCGTGTGGCGGTGTTGTTCTCGTCGGGTTGATGGCGCTGACTCTGTCACCATA 
TTACAAGCGCTATATCAGCTGGTGCTTGTGGTGGCTTCAGTATTTTCTGACCAGAGTGGA 
AGCGCAACTGCACGTGTGGATTCCCCCCCTCAACGTCCGAGGGGGGCGCGACGCCGTCAT 
CTTACTCATGTGTGCTGTACACCCGACTCTGGTATTTGACATCACCAAATTGCTGCTGGC-3000 
CGTCTTCGGACCCCTTTGGATTCTTCAAGCCAGTTTGCTTAAAGTACCCTACTTTGTGCG 
CGTCCAAGGCCTTCTCCGGTTCTGCGCGTTAGCGCGGAAGATGATCGGAGGCCATTACGT 
GCAAATGGTCATCATTAAGTTAGGGGCGCTTACTGGCACCTATGTTTATAACCATCTCAC 
TCCTCTTCGGGACTGGGCGCACAACGGCTTGCGAGATCTGGCCGTGGCTGTAGAGCCAGT 
CGTCTTCTCCCAAATGGAGACCAAGCTCATCACGTGGGGGGCAGATACGGCCGCGTGCGG-3300 
TGACATCATCAACGGCTTGCCTGTTTCCGCCCGCAGGGGCCGGGAGATACTGCTCGGGCC 
AGCCGATGGAATGGTCTCCAAGGGGTGGAGGTTGCTGGCGCCCATCACGGCGTACGCCCA 
GCAGACAAGGGGCCTCCTAGGGTGCATAATCACCAGCCTAACTGGCCGGGACAAAAACCA 
AGTGGAGGGTGAGGTCCAGATTGTGTCAACTGCTGCCCAAACCTTCCTGGCAACGTGCAT 



FIG. 62B 

CAATGG66T6TGCT6GACTGTCTACCACGGGGCCG6AACGAGGACCATCGCGTCACCCAA-3600 
GGGTCCTGTCATCCAGATGTATACCAAT6TA6ACCAAGACCTTGTGGGCTGGCCC6CTCC 
GCAAGGTAGCCGCTCATTGACACCCTGCACTTGCGGCTCCTCGGACCTTTACCTGGTCA^ 
GAGGCACGCCGATGTCATTCCCGTGCGCCGGCGGGGTGATAGC^ 

GCCCCGGCCCATTTCCTACTTGAAAGGCTCCTCGGGGGGTCCGCTGTTGTGCCCCGCGGG ^ nnn 

GCACGCCGTGGGCATATTTAGGGCCGCGGTGTGCACCCGTGGAGTGGCTAAGG^ 

CnTATCCCTGTGGAGAACCTAGAGACAACCATGAGGTCCCCGGTGTTCACGGA^^ 

CTCTCCACCAGTAGTGCCCCAGAGCTTCCAGGTGGCTCACCTCCATGCTCCCACA^ 

CGGCAAAAGCACCAAGGTCCCGGCTGCATATGCAGCTCAGGGCTATAAGGTGCTAGTACT 

CAACCCCTCTGTTGCTGCAACACTGGGCTTTGGTGCTTA^ 

CGATCCTAACATCAGGACCGGGGTGAGAACAAHACCACTO^ 

CACCTACGGCAAGTTCCTTGCCGACGGCGGGTGCTCGGGGGGCGCTTATGACATAATAAT 

TTGTGACGAGTGCCACTCCAGGGATGCCACATCCATCTTGGGCATCGGCACTGTCCTTGA 

CCAAGCAGAGACTGCGGGGGCGAGACTGGTTGTGCTCGCCACCGCCACCCg 

CGTCACTGTGCCCCATCCCAACATCGAGGAGGTTGCTCTGTCCACCACCGGAGAGATCCC 

nnTACGGCAAGGCTATCCCCCTCGAAGTAATCAAGGGGGGGAGACATCTCATCTTa 

TCAnCAAAGAAGAAGTGCGACGAACTCGCCGCAAAGCTGGTCGCATTGGGCAKAA^ 

CGTGGCCTACTACCGCGGTCTTGACGTGTCCGTCATCCCGACCAGCGGCGATGTTGTCGT 

CGTGGCAACCGATGCCCTCATGACCGGCTATACCGGCGACnCGACTCGGTGATAGACTG 

CAATACGTGTGTCACCCAGACAGTCGATTTCAGCCTTGACCgACgTCACCAnGAGAC 

AATCACGCTCCCCCAGGATGCTGTCTCCCGCACTCAACGTCGGGGCAGGACTGGCAGGGG-4800 

GAAGCCAGGCATCTACAGAnTGTGGCACCGGGGGAGCGCCCCTCCGGCATGTTCGACTC 

GTCCGTCCTCTGTGAGTGCTATGACGCAGGCTGTGCTTGGTATGAGgCACGCCCGCCGA 

gactacagttaggctacgagcgtacatgaacaccccggggcttcccgtgtgccaggacca 
tcttgaattttgggagggcgtctttacaggcctcactcatatagatgcccacttt™ 

CCAGACAAAGCAGAGTGGGGAGAACCTTCCTTACCTGGTAGCGTACCA^ 

cgctagggctcaagcccctcccccatcgtgggaccagatgtggaagtgtttgattcgcct 

CAAGCCCACCCTCCATGGGCCAACACCCCTGCTATACAG^ 

aatcaccctgacgcacccagtcaccaaatacatcatgacatgcatgtcggccg^ 
ggtcgtcacgagcacctgggtgctcgttggcggcgtcctggctgctttggccgcgtattg 

CCTGTCAACAGGCTGCGTGGTCATAGTGGGCAGGGTCGTCTTGTCCGGGAAGCCGGCAAT-5400 
CATACCTGACAGGGAAGTCCTCTACCGAGAGTTCGATGAGATGGAAGAGTGCTCTCAGCA 
CTTACCGTACATCGAGCAAGGGATGATGCTCGCCGAGCAgTCAAGCAGAAGGCa 
CCTCCTGCAGACCGCGTCCCGTCAGGCAGAGGTTATCGCCCCTGCTGTCCAGACCAACTG 

AGCTGGCGCCGCCATCGGCAGTGTTGGACTGGGGAAGGTCCTCATAGACATCgjGCAGG 

GTATGGCGCGGGCGTGGCGGGAGCTCnGTGGCAnCAAGATCATGAGM cnnn 

CTCCACGGAGGACCTGGTCAATCTACTGCCCGCCATCCTCTC6CCCGGAGCCCTM 

CGGCGTGGTCTGTGCAGCAATACTGCGCCGGCACGTTGGCCCGGGCGAGGGGGCA^ 

GTGGATGAACCGGCTGATAGCCTTCGCCTCCCGGGGGAACCATG^ 

CGTGCCGGAGAGCGATGCAGCTGCCCGCGTCACTGCCATACTCAGCAGCCTCACTGTAAC 

CCAGCTCCTGAG6CGACTGCACCAGTGGATAAGCTCGGAGTGTACCACTCMTGCTCCGG 

TTCCTGGCTAAGGGACATCTGGGACTGGATATGCGAGGTGTTGAGCGACTTTAAGACCTG-6300 
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FIG. 62D 

1 CACTCCACCATGAATCACTCCCCT6TGAGGA/\CTAGTGTCTTCACGCAGAAAGCGTCTA6 
GTGAGGTGGTACTTAGTGAGGGGACACTCCTTGATGACAGAAGTGCGTCTTTCGCAGATC 

61 CCATGGCGTTAGTATGAGTGTCGTGCAGCCTCCAGGACCCCCCCTCCCGGGAGAGCCATA 
GGTACCGCAATCATACTCACAGCACGTCGGAGGTCCTGGGGGGGAGGGCCCTCTCGGTAT 

121 GTGGTCTGCGGAACCGGTGAGTACACCGGAATTGCCAGGACGACCGGGTCCTTTCTTGGA 
CACCAGACGCCTTGGCCACTCATGTGGCCTTAACGGTCCTGCTGGCCCAGGAAAGAACCT 

181 TCAACCCGCTCAATGCCTGGAGATTTGGGCGTGCCCCCGCAAGACTGCTAGCCGAGTAGT 
AGTTGGGCGAGTTACGGACCTCTAAACCCGCACGGGGGCGTTCTGACGATCGGCTCATCA 

241 GTTGGGTCGCGAAAGGCCTTGTGGTACTGCCTGATAGGGTGCTTGCGAGTGCCCCGGGAG 
CAACCCAGCGCTTTCCGGAACACCATGACGGACTATCCCACGAACGCTCACGGGGCCCTC 

301 GTCTCGTAGACCGTGCACCATGAGCACGAATCCTAAACCTCAAAAAAAAAACAAACGTAA 
CAGAGCATCTGGCACGTGGTACTCGTGCTTAGGATTTGGAGTTTTTTTTTTGTTTGCATT 

361 CACCAACCGTCGCCCACAGGACGTCAAGTTCCCGGGTGGCGGTCAGATCGTTGGTGGAGT 
GTGGTTGGCAGCGGGTGTCCTGCAGTTCAAGGGCCCACCGCCAGTCTAGCAACCACCTCA 

421 TTACTTGTTGCCGCGCAGGGGCCCTAGATTGGGTGTGCGCGCGACGAGAAAGACTTCCGA 
AATGAACAACGGCGCGTCCCCGGGATCTAACCCACACGCGCGCTGCTCTTTCTGAAGGCT 

181 GCGGTCGCAACCTCGAGGTAGACGTCAGCCTATCCCCAAGGCTCGTCGGCCCGAGGGCAG 
CGCCAGCGTTGGAGCTCCATCTGCAGTCGGATAGGGGTTCCGAGCAGCCGGGCTCCCGTC 

5m GACCTGGGCTCAGCCCGGGTACCCTTGGCCCCTCTATGGCAATGAGGGCTGCGGGTGGGC 
CTGGACCCGAGTCGGGCCCATGGGAACCGGGGAGATACCGTTACTCCCGACGCCCACCCG 

601 GGGATGGCTCCTGTCTCCCCGTGGCTCTCGGCCTAGCTGGGGCCCCACAGACCCCCGGCG 
CCCTACCGAGGACAGAGGGGCACCGAGAGCCGGATCGACCCCGGGGTGTCTGGGGGCCGC 

661 TAGGTCGCGCAATTTGGGTAAGGTCATCGATACCCTTACGTGCGGCTTCGCCGACCTCAT 
ATCCAGCGCGTTAAACCCATTCCAGTAGCTATGGGAATGCACGCCGAAGCGGCTGGAGTA 

721 GGGGTACATACCGCTCGTCGGCGCCCCTCTTGGAGGCGCTGCCAGGGCCCTGGCGCATGG 
CCCCATGTATGGCGAGCAGCCGCGGGGAGAACCTCCGCGACGGTCCCGGGACCGCGTACC 

781 CGTCCGGGTTCTGGAAGACGGCGTGAACTATGCAACAGGGAACCTTCCTGGTTGCTCTTT 
GCAGGCCCAAGACCTTCTGCCGCACTTGATACGTTGTCCCTTGGAAGGACCAACGAGAAA 

Ml CTCTATCTTCCTTCTGGCCCTGCTCTCTTGCTTGACTGTGCCCGCTTCGGCCTACCAAGT 
GAGATAGAAGGAAGACCGGGACGAGAGAACGAACTGACACGGGCGAAGCCGGATGGTTCA 

901 GCGCAACTCCACGGGGCTTTACCACGTCACCAATGATTGCCCTAACTCGAGTATTGTGTA 
CGCGTTGAGGTGCCCCGAAATGGTGCAGTGGTTACTAACGGGATTGAGCTCATAACACAT 

961 CGAGGCGGCCGATGCCATCCTGCACACTCCGGGGTGCGTCCCTTGCGTTCGTGAGGGCAA 
GCTCCGCCGGCTACGGTAGGACGTGTGAGGCCCCACGCAGGGAACGCAAGCACTCCCGTT 

1021 CGCCTCGAGGTGTTGGGTG6CGATGACCCCTACGGTGGCCACCAGGGATGGCAAACTCCC 
GCGGAGCTCCACAACCCACCGCTACTGGGGATGCCACCGGTGGTCCCTACCGTTTGAGGG 

1081 CGCGACGCAGCTTCGACGTCACATCGATCTGCTTGTCGGGA6CGCCACCCTCT6TTCGGC 
GCGCTGCGTCGAAGCTGCAGTGTAGCTAGACGAACAGCCCTCGCGGTGGGAGACAAGCCG 

1141 CCTCTACGTGGGGGACCTATGCGGGTCTGTCTTTCTTGTCGGCCAACTGTTCACCTTCTC 
GGAGATGCACCCCCTGGATACGCCCAGACAGAAAGAACAGCCGGTTGACAAGTGGAAGAG 

1201 TCCCAGGCGCCACTGGACGACGCAAGGTTGCAATTGCTCTATCTATCCCGGCCATATAAC 
AGGGTCCGCGGTGACCTGCTGCGTTCCAACGTTAACGAGATAGATAGGGCCGGTATATTG 

1261 GGGTCACCGCATGGCATGGGATATGATGATGAACTGGTCCCCTACGACGGCGTTGGTAAT 
CCCAGTGGCGTACCGTACCCTATACTACTACTTGACCAGGGGATGCTGCCGCAACCATTA 
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FIG. 62G 

3961 CTCTCCACCAGTA6T6CCCCAGAGCTTCCAG6T6GCTCACCTCCATGCTCCCACAGGCAG 
GAGAGGT6GTCATCACGGGGTCTCGAAGGTCCACCGAGTGGAGGTACGAGGGTGTCCGTC 

4021 CGGCAAAAGCACCAAGGTCCCGGCTGCATATGCAGCTCAGGGCTATAAGGTGCTAGTACT 
GCCGTTTTCGTGGTTCCAGGGCCGACGTATACGTCGAGTCCCGATATTCCACGATCATGA 

4081 CAACCCCTCTGTTGCTGCAACAGTGG6CTTTGGTGCTTACATGTCCAAGGCTCATGGGAT 
GTTGGGGAGACAACGACGTTGTGACCCGAAACCACGAATGTACAGGTTCCGAGTACCCTA 

4141 CGATCCTAACATCAGGACCGGGGTGAGAACAATTACCACTGGCAGCCCCATCACGTACTC 
GCTAGGATTGTAGTCCTGGCCCCACTCTTGTTAATGGTGACCGTCGGGGTAGTGCATGAG 

4201 CACCTACGGCAAGnCCTTGCCGACGGCGGGTGCTCGGGGGGCGCTTATGACATAATAAT 
GTGGATGCCGTTCAAGGAACGGCTGCCGCCCACGAGCCCCCCGCGAATACTGTATTATTA 

4261 TTGTGACGAGTGCCACTCCACGGATGCCACATCCATCTTGGGCATCGGCACTGTCCn 
AACACTGCTCACGGTGAGGTGCCTACGGTGTAGGTAGAACCCGTAGCCGTGACAGGAACT 

1321 CCAAGCAGAGACTGCGGGGGCGAGACTGGTTGTGCTCGCCACCGCCACCCCTCCGGGCTC 
GGTTCGTCTCTGACGCCCCCGCTCTGACCAACACGAGCGGTGGCGGTGGGGAGGCCCGAG 

1381 C6TCACTGTGCCCCATCCCAACATCGAGGAGGTTGCTCTGTCCACCACCGGAGAGATCCC 
GCAGTGACACGGGGTAGGGTTGTAGCTCCTCCAACGAGACAGGTGGTGGCCTCTCTAGGG 

4441 TTTTTACGGCAAGGCTATCCCCCTCGAAGTAATCAAGGGGGGGAGACATCTCATCTTCTG 
AAAAATGCCGTTCCGATAGGGGGAGCTTCATTAGTTCCCCCCCTCTGTAGAGTAGAAGAC 

4501 TCAnCAAAGAAGAAGTGCGACGAACTCGCCGCAAAGCTGGTCGCATTGGGCATCAATGC 
AGTAAGTTTCTTCTTCACGCTGCTTGAGCGGCGTTTCGACCAGCGTAACCCGTAGTTACG 

4561 CGTGGCCTACTACCGCGGTCTTGACGTGTCCGTCATCCCGACCAGCGGCGATGTTGTCG^ 
GCACCGGATGATGGCGCCAGAACTGCACAGGCAGTAGGGCTGGTCGCCGCTACAACAGCA 

4621 CGTGGCAACCGATGCCCTCATGACCGGCTATACCGGCGACTTCGACTCGGTGATAGACT 
GCACCGTTGGCTACGGGAGTACTGGCCGATATGGCCGCTGAAGCTGAGCCACTATCTGAC 

4681 CAATACGTGTGTCACCCAGACAGTCGATTTCAGC^ 

GTTATGCACACAGTGGGTCTGTCAGCTAAAGTCGGAACTGGGATGGAAGTGGTAACTCTG 

4741 AATCACGCTCCCCCAGGATGCTGTCTCCCGCACTCAACGTCGGGGCAGGACTGGCAM 
TTAGTGCGAGGGGGTCCTACGACAGAGGGCGTGAGTTGCAGCCCCGTCCTGACCGTCCCC 

4801 GAAGCCAGGCATCTACAGATTTGTGGCACCGGGGGAGCGCCCCTCCGGCATGn 

CTTCGGTCCGTAGATGTCTAAACACCGTGGCCCCCTCGCGGGGAGGCCGTACAAGCTGAG 

4861 GTCCGTCCTCTGTGAGTGCTATGACGCAGGCTGTGCTTGGTATGAGCTCACGC^ 

CAGGCAGGAGACACTCACGATACTGCGTCCGACACGAACCATACTCGAGTGCGGGCGGCT 

4921 GACTACAGTTAGGCTACGAGCGTACATGAACACCCCGGGGCTTCCCGTGTGCCAGGAa 
CTGAT6TCAATCCGATGCTCGCATGTACTTGTGGGGCCCCGAAGGGCACACGGTCCTGGT 

4981 TCTTGAATTTTGGGAGGGCGTCTTTACAGGCCTCACTCATATAGATGCCCA^ 

AGAACTTAAAACCCTCCCGCAGAAATGTCCGGAGTGAGTATATCTACGGGTGAAAGATAG 

5041 CCAGACAAAGCAGAGTGGGGAGAACCTTCCTTACCTGGTAGCGTACCAAGCQ 

GGTCTGTTTCGTCTCACCCCTCTTGGAAGGAATGGACCATCGCATGGTTCGGTGGCACAC 

5101 CGCTAGGGCTCAAGCCCCTCCCCCATCGTGGGACCAGATGTGGAAGTGTTTGATTCG^ 
GCGATCCCGAGTTC66GGAGGGGGTAGCACCCTGGTCTACACCTTCACAAACTAAGCGGA 

5161 CAAGCCCACCCTCCATGGGCCAACACCCCTGCTATACAGACTGGGCGCTGTTCAG^ 

GTTCGGGTGGGAGGTACCCGGTTGTGGGGACGATATGTCTGACCCGCGACAAGTCTTACT 

5221 AATCACCCTGACGCACCCAGTCACCAAATACATCATGACATGCAT^ 

TTAGTGGGACTGCGTGGGTCAGTGGTTTATGTAGTACTGTACGTACAGCCGGCTGGACCT 



FIG. 62H 

5281 ssgEW&a^^ 

CTTACCGtACATCGAGCAAGGGATGATGCTCGCCGAGCAGTTCAAGCAGAAGGCCCTCGG 
GAATGGCATGTAGCTCGTTCCCTACTACGAGCGGCTCGTCAAGTTCGTCTTCCGGGAGCC 

CTTGGCGGGCTTGTCAACGCTGCCTGGTAACCCCGCCATTO 
GAACCGCCCGAACAGTTGCGACGGACCATTGGGGCGGTAACGAAGTAACTACCGAAAATG 

AGCTGCTGTCACCAGCCCACTAACCACTAGCCAAACCCTCCTCTTCAAC^ 
TCGACGACAGTGGTCGGGTGATTGGTGATCGGTTTGGGAGGAGAAGTTGTATAACCCCCC 

GTGGGTGGCTGCCCAGCTCGCCGCCCCCGGTGCCGCTACTGCCTTTGTGGGCGCTGG^ 
CACCCACCGACGGGTCGAGCGGCGGGGGCCACGGCGATGACGGAAACACCCGCGACCGAA 

AGCTGGCGCCGCCATCGGCAGTGTTGGACTGGGGAAGGTCCTCATAGACATCajGCAGG 
TCGACCGCGGCGGTAGCCGTCACAACCTGACCCCTTCCAGGAGTATCTGTAGGAACGTCC 

GTATGGCGCGGGCGTGGCGGGAGCTCTTGTGGCATTCAAGATCATGAGCGGTGAGGTCCC 
CATACCGCGCCCGCACCGCCCTCGAGAACACCGTAAGTTCTAGTACTCGCCACTCCAGGG 

CTCCACGGAGGACCTGGTCAATCTACTGCCCGCCATCCTCTCGCCCGGAGCCCTCGTAGT 
GAGGTGCCTCCTGGACCAGTTAGATGACGGGCGGTAGGAGAGCGGGCCTCGGGAGCATCA 

CGGCGTGGTCTGTGCAGCAATACTGCGCCGGCACGTTGGCCCGG^ 
GCCGCACCAGACACGTCGTTATGACGCGGCCGTGCAACGGGGCCCGCTCCCCCGTCACGT 

GTGGATGAACCGGCTGATAGCCTTCGCCTCCCGGGGGAACCATGJTTC 
CACCTACTTGGCCGACTATCGGAAGCGGAGGGCCCCCTTGGTACAAAGGGGGTGCGTGAT 

CGTGCCGGAGAGCGATGCAGCTGCCCGCGTCACTGCCATACTC 
GCACGGCCTCTCGCTACGTCGACGGGCGCAGTGAC6GTATGAGTCGTCGGAGTGACATTG 

CCAGCTCCTGAGGCGACTGCACCAGTGGATAAGCTCGGAGTGTACCACT^ 
GGTCGAGGACTCCGCTGACGTGGTCACCTATTCGAGCCTCACATGGTGAGGTACGAGGCC 

TTCCTGGCTAAGGGACATCTGGGACTGGATATGCGAGGTGTTGA^ 
AAGGACCGATTCCCTGTAGACCCTGACCTATACGCTCCACAACTCGCTGAAATTCTGGAC 

GCTAAAAGCTAAGCTCATGCCACAGCTGCCTGGGATCCCCm 
CGATTTTCGATTCGAGTACGGTGTCGACGGACCCTAGGGGAAACACAGGACGGTCGCGCC 

GTATAAGGGGGTCTGGCGAGTGGACGGCATCATGCACACTCGCTGCCACTGTGGAGCT^ 
CATATTCCCCCAGACCGCTCACCTGCCGTAGTACGTGTGAGCGACGGTGACACCTCGACT 

GATCACTGGACATGTCAAAAACGGGACGATGAGGATCGTCGGTCaAGGACgGCAGGAA 
CTAGTGACCTGTACAGTTTTTGCCCTGCTACTCCTAGCAGCCAGGATCCTGGACGTCCTT 

CATGTGGAGTGGGACCTTCCCCATTAATGCCTACACCACGGGCCCCTGTACCCCCCTTCC 
GTACACCTCACCCTGGAAGGGGTAATTACGGATGTGGTGCCCGGGGACATGGGGGGAAGG 

TGCGCCGAACTACACGTTCGCGCTATGGAGGGTGTCTGCAGAGGAATATGTGGAGATAAG 
ACGCGGCTTGATGTGCAAGCGCGATACCTCCCACAGACGTCTCCTTATACACCTCTATTC 
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FIG. 62J 

7921 G6CCGTAACCCACATCAACTCCGTGTG6AAAGACCTTCTGGAAGACAATGTAACACCAAT 
CCGGCATTGGGTGTAGTTGAGGCACACCTTTCTGGAAGACCTTCTGTTACATTGTGGTTA 

7981 AGACACTACCATCATGGCTAAGAACGAGGTTTTCTGCGTTCAGCCTGAGAAGGGGGGTCG 
TCTGTGATGGTAGTACCGATTCTTGCTCCAAAAGAGGCAAGTCGGACTCTTCCCCCCAGC 

8041 TAAGCCAGCTCGTCTCATCGTGTTCCCCGATCTGGGCGTGCGCGTGTGCGAAAAGATGGC 
ATTC6GTCGAGCAGAGTAGCACAAGGGGCTAGACCCGCACGCGCACACGCTTTTCTACCG 

8101 TTTGTACGACGTGGTTACAAAGCTCCCCTTGGCCGTGATGGGAAGCTCCTACGGATTCCA 
AAACATGCTGCACCAATGTTTCGAGGGGAACCGGCACTACCCTTCGAGGATGCCTAAGGT 

8161 ATACTCACCAGGACAGCGGGTTGAATTCCTCGTGCAAGCGTGGAAGTCCA^ 

TATGAGTGGTCCTGTCGCCCAACTTAAGGAGCACGTTCGCACCTTCAGGTTCTTTTGGGG 

8221 AATGGGGTTCTCGTATGATACCCGCTGCTTTGACTCCACAGTCACTGAGAGCGACATCCG 
TTACCCCAAGAGCATACTATGGGCGACGAAACTGAGGTGTCAGTGACTCTCGCTGTAGGC 

8281 TACCGAGGAGGCAATCTACCAATGTTGTGACCTCGACCCCCAAGCCCGCGTGGCCATCAA 
ATGCCTCCTCCGTTAGATGGTTACAACACTGGAGCTGGGGGTTCGGGCGCACCGGTAGTT 

8341 GTCCCTCACCGAGAGGCTTTATGTTGGGGGCCCTCTTAGCAATTCAAGGGGGGAGAACTG 
CAGGGAGTGGCTCTCCGAAATACAACCCCCGGGAGAATGGTTAAGTTCCCCCCTCTTGAC 

8401 CGGCTATCGCAGGTGCCGCGCGAGCGGCGTACTGACAACTAGCTGTGGTAACACCCTCAC 
GCCGATAGCGTCCACGGCGCGCTCGCCGCATGACTGTTGATCGACACCATTGTGGGAGTG 

8461 TTGCTACATCAAGGCCCGGGCAGCCTGTCGAGCCGCAGGGCTCCAGGACTGCACCATGCT 
AACGATGTAGTTCCGGGCCCGTCGGACAGCTCGGCGTCCCGAGGTCCTGACGTGGTACGA 

8521 CGTGTGTGGCGACGACTTAGTCGTTATCTGTGAAAGCGCGGGGGTCCAGGAGGACGCGGC 
GCACACACCGCTGCTGAATCAGCAATAGACACTTTCGCGCCCCCAGGTCCTCCTGCGCCG 

8581 GAGCCTGAGAGCCTTCACGGAGGCTATGACCAGGTACTCCGCCCCCCCTGGGGAC^ 

CTCGGACTCTCGGAAGTGCCTCCGATACTGGTCCATGAGGCGGGGGGGACCCCTGGGGGG 

8641 ACAACCAGAATACGACTTGGAGCTCATAACATCATGCTCCTCCAACGTGTCAGTCGCCCA 
TGTTGGTCTTATGCTGAACCTCGAGTATTGTAGTACGAGGAGGTTGCACAGTCAGCGGGT 

8701 CGACGGCGCTGGAAAGAGGGTCTACTACCTCACCCGTGACCCTACAACCCCCCTCGCGAG 
GCTGCCGCGACCTTTCTCCCAGATGATGGAGTGGGCACTGGGATGTTGG6GGGAGCGCTC 

8761 AGCTGCGTGGGAGACAGCAAGACACACTCCAGTCAATTCCTGGCTAGGCAACATAATCAT 
TCGACGCACCCTCTGTCGTTCTGTGTGAGGTCAGTTAAGGACCGATCCGTTGTATTAGTA 

8821 GTTTGCCCCCACACTGTGGGCGAGGATGATACTGATGACCCATTTCTTTAGCGTCCTTAT 
CAAACGGGGGTGTGACACCCGCTCCTACTATGACTACTGGGTAAAGAAATCGCAGGAATA 

8881 AGCCAGGGACCAGCTTGAACAGGCCCTCGATTGCGAGATCTACGGGGCCTGCTACTCCAT 
TCGGTCCCTGGTCGAACTTGTCCGGGA6CTAACGCTCTAGATGCCCCGGACGATGAGGTA 

8941 AGAACCACTTGATCTACCTCCAATCATTCAAAGACTCCATGGCCTCAGCGCATTTTCACT 
TCTTG6TGAACTAGATGGAGGTTAGTAAGTTTCTGAGGTACCGGAGTCGCGTAAAAGTGA 

9001 CCACAGTTACTCTCCAGGTGAAATTAATAGGGTGGCCGCATGGCTCAGAAAACTTGGGGT 
GGTGTCAATGAGAGGTCCACTTTAATTATCCCACCGGCGTACGGAGTCTTTTGAACCCCA 

9061 ACCGCCCTTGCGAGCTTGGAGACACCG66CCCGGAGCGTCCGCGCTAGGCTTCTGGCCAG 
TGGCGGGAACGCTCGAACCTCTGTGGCCCGGGCCTCGCAGGCGCGATCCGAAGACCGGTC 

9121 AGGAGGCAGGGCTGCCATATGTGGCAAGTACCTCTTCAACTGGGCAGTAAGAACAAAGCT 
TCCTCCGTCCCGACGGTATACACCGTTCATGGAGAAGTTGACCCGTCATTCTTGTTTCGA 
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TRANSFORM Ecoli WITH RECOMBINANT PLASMIDS 




| BSA ABSORBTION/DNAse/LYSOZYME 
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ANTIBODY 
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SOD 
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NT. = EXPRESSION NOT TESTED 
t THIS POLYPEPTIDE WAS NEGATIVE IN THIS 
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R T 

MSTNPKPQKKNKRNTNRRPQDVKFPGGGQIVGGVYLLPRRGPRLGVRATR 
KTSERSQPRGRRQPIPKARRPEGRTWAQPGYPWPLYGNEGCGWAGWLLSP-100 
RGSRPSWGPTDPRRRSRNLGKV I DTLTCGFADLMGY I PLVGAPLGGAARA 

T 

LAHGVRVLEDGVNYATGNLPGCSFSIFLLALLSCLTVPASAYQVRNSTGL-200 
YHVTNDCPNSSIVYEAADA1LHTPGCVPCVREGNASRCWVAMTPTVATRD 
GKLPATQLRRH I DLLVGS ATLCSALYVGDLCGSVFLVGQLFTFSPRRHWT-300 

V 

TQGCNCS IYPGHI TGH RMAWDMMMNWSPTTALVMAQLLR I PQA I LDM I AG 

AHWGVLAGIAYFSMVGNWAKVLVVLLLFAGVDAETHVTGGSAGHTVSGFV-400 

SLLAPGAKQNVQL I NTNGSWHLNSTALNCNDSLNTGWLAGLFYHHKFNSS 

GCPERLASCRPLTDFDQGWGPISYANGSGPDQRPYCWHYPPKPCGIVPAK-500 

SVCGPVYCFTPSPVVVGTTDRSGAPTYSWGENDTDVFVLNNTRPPLGNWF 

GCTWMNSTGFTKVCGAPPCVIGGAGNNTLHCPTDCFRKHPDATYSRCGSG-600 

PWLTPRCLVDYPYRLWHYPCTINYTIFKIRMYVGGVEHRLEAACNWTRGE 
RCDLEDRDRSELSPLLLTTTQWQVLPCSFTTLPALSTGLIHLHQNIVDVQ-700 
YLYGVGSS I ASWAI KWEYVVLLFLLLADARVCSCLWMMLL I SQAEAALEN _ 
LVILNAASLAGTHGLVSFLVFFCFAWYLKGKWVPGAVYTFYGMWPLLLLL-800 

(N) 

LALPQRAYALDTEVAASCGGVVLVGLMALTLSPYYKRY I SWCLWWLQYFL nnn 
TRVEAQLHVW I PPLNVRGGRDAV I LLMCAVHPTLVFD I TKLLLAVFGPLN-900 
ILQASLLKVPYFVRVQGLLRFCALARKMIGGHYVQMVI IKLGALTGTYVY 
NHLTPLRDWAHNGLRDLAVAVEFVVFSQMETKLITWGADTAACGDI INGL-1000 
PVSARRGREILLGPADGIWSKGWRLLAPITAYAQQTRGLLGCI ITSLTGR _ 
DKNQVEGEVQ I VSTAAQTTFATC I NGVCWTVYHG AGTRTI ASPKGPV I QM-1 100 
YTNVDQDLVGWPAPQGSRSLTPCTCGSSDLYLVTRHADVIPVRRRGDSRG 
SLLSPRPISYLKGSSGGPLLCPAGHAVGIFRAAVCTRGVAKAVDFIPVEN-1200 
LETTMRSPVFTDNSSPPVVPQSFQVAHLHAPTGSGKSTKVPAAYAAQGYK 

VLVLNPSVA ATLGFGAYMSKAHG IDPNI RTGVRT I TTGSP I TYSTYGKFL- 1300 

ADGGCSGGAYD 1 1 1 CDECHSTDATS I LG I GTVLDQAETAGARLVVLATAT n § nn 
PPGSVTVPHPN I EEVALSTTGE I PFYGKA I PLEV I KGGRHL I FCHSKKKC-1400 
DELAAKLVALG I N AVAYYRGLDVSVI PTSGDVVVVATDALMTGYTGDFDS 

Y (S) 
VI DCNTCVTQTVDFSLDPTFT I ET I TLPQDA VSRTQRRGRTGRGKPG I YR- 1500 
FVAPGERPSGMFDSSVLCECYDAGCAWYELTPAETTVRLRAYMNTPGLPV nnn 
CQDHLEFWEGVFTGLTHIDAHFLSQTKQSGENLPYLVAYQATVCARAQAP-1600 
PPSWDQMWKCL I RLKPTLHGPTPLLYRLGAVQNE I TLTHPVTKY I MTCMS , _ 
ADLEVVTSTWVLVGGVLAALAAYGLSTGCVVIVGRWLSGKPAI IPDREV-1700 

LYREFDEMEECSQHLPY I EQGMMLAEQFKQKALGLLQTAS RQAEV IAPAV 

QTNWQKLETFWAKHMWNF I SG I QYLAGLSTLPGNPA I ASLMAFTA AVTSP- 1800 
LTTSQTLLFN I LGGWVAAQLA APGAATAFVGAGLAGAA I GSVGLGKVL I D 



FIG. 66B 



(G) 

ILAGYGAGVAGALVAFKIMSGEVPSTEDLVNLLPAILSPGALVVGVVCAA-1900 

(HO 

ILRRHVGPGEGAVQWMNRLIAFASRGNHVSPTHYVPESDAAARVTAILSS 
LTVTQLLRRLHQWISSECTTPCSGSWLRDIWDWICEVLSDFKTWLKAKLM-2000 

(V) 

PQLPG 1 PFVSCQRGYKGVWRGDG I MHTRCHCGAE I TGHVKNGTMR I VGPR 

TCRNMWSGTFPI NAYTTGPCTPLPAPNYTFALWRVSAEEYVEI RQVGDFH-2100 

YVTGMTTDNLKCPCQVPSPEFFTELDGVRLHRFAPPCKPLLREEVSFRVG 

LHEYPVGSQLPCEPEPDVAVLTSMLTDPSHITAEAAGRRLARGSPPSVAS-2200 

SSASQLSAPSLKATCTANHDSPDAELIEANLLWRQEMGGNITRVESENKV 

VILDSFDPLVAEEDEREISVPAEILRKSRRFAQALPVWARPDYNPPLVET-2300 

S 

WKKPDYEPPVVHGCPLPPPKSPPVPPPRKKRTVVLTESTLSTALAELATR 

(FA) 

SFGSSSTSGITGDNTTTSSEPAPSGCPPDSDAESYSSMPPLEGEPGDPDL-2400 

SDGSWSTVSSEANAEDVVCCSMSYSWTGALVTPCAAEEQKLPINALSNSL 

LRHHNLVYSTTSRSACQRQKKVTFDRLQVLDSHYQDVLKEVKAAASKVKA-2500 

(F) 

NLLSVEEACSLTPPHSAKSKFGYGAKDVRCHARKAVTH I NSVWKDLLEDN 

VTPIDTTIMAKNEVFCVQPEKGGRKPARLIVFPDLGVRVCEKMALYDVVT-2600 

KLPLAVMGSSYGFQYSPGQRVEFLVQAWKSKKTPMGFSYDTRCFDSTVTE 

(G) 

SD I RTEEA I YQCCDLDPQARVA I KSLTERLYVGGPLTNSRGENCGYRRCR-2700 

ASGVLTTSCGNTLTCYIKARAACRAAGLQDCTMLVCGDDLVV1CESAGVQ 

ED AASLRAFTEAMTRYS APPGDPPQPEYDLEL I TSCSSNVSVAHDGAGKR-2800 

VYYLTRDPTTPLARAAWETARHTFVNSWLGNIIMFAPTLWARMILMTHFF 

SVL I ARDQLEQALDCEI Y6ACYS I EPLDLPP1 IQRLHGLSAFSLHSYSPG-2900 

G 

EINRVAACLRKLGVPPLRAWRHRARSVRARLLARGGRAAICGKYLFNWAV 
RTKLK — (Stop codon not yet reached) 



( ) = Heterogeneity due to possible 5' or 3' 
terminal cloning artefacts. 
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FIG. 68 



NS5 

Highly-conserved 
Polymerase 

NS3 region region 

Flavivi ruses TATPPG SAAQRRGRIGRNP — GDDCVV 

(Yellow Fever, 
West Nile, Dengue) 

«**••• » ••*•» *« «•« •» 

HCV TATPPG SRTQRRGRTGRGK GDDLVV 

i l l 

#1348 #1483 #2737 



FIG. 73 

5' 3' 
CCGGGCGAGGGGGCAGTGCAGTGGATGAACCGGCTGATAGCCTTCGCCTCCCGGGGGAAC 
CGCTCCCCCGTCACGTCACCTACTTGGCCGACTATCGGAAGCGGAGGGCCCCCTTG 
3' 5' 

5' 3' 
CATGTTTCCCCCTAATGAG 
GTACAAAGGGGGATTACTCAGC 
3' 5' 
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1 GAATTCGGGTCCGTCATCCCGACCAGCGGCGATGTTGTCGTCGTGGCAACCGATGCCCTC 
CTTAAGCCCAGGCAGTAGGGCTGGTCGCCGCTACAACAGCAGCACCGTTGGCTACGGGAG 

1 ECORl, 7 NLA1V, 8 AVA2 SAU96, 15 FOK1, 24 NSPB11, 26 FNU4H 
1, 52 SFAN1, 57 MNL1, 60 NLAlll, 

TACTGGCCX3ATATGGCCGCTGAAGCTGAGCCACTATCTGACGTTATGCACACAGTGGGTC 

65 HPA11, 74 HPA11, 83 TAQ1, 85 HINF1, 90 HPH, 106 AFL111 MA 
E2, 112 MAE3, 113 HPH, 

ThrValAspPheSerl^uAspProThrPheThrlleGluThrlleThrLeuProGlnAsp 
121 ACAGTCGATTTCAGCCTTGACCCTACCTTCACCATTGAGACAAT^CGCTCCCCCAMAT 
TGTCAGCTAAAGTCGGAACTGGGATGGAAGTGGTAACTCTGTTAGTGCGAGGGGGTTCTA 

125 TAQ1, 149 HPH, 178 SFANl, 

AlaValSerArgThrGlnArgArgGlyArgThxGlyArgGlyLysPr^lylle^rArg 
181 GCTGTCTCCCG^CTCAACGTCGGGGCAGGACTGGCAGGGGGAAGCCAGG^TCTACAGA 
CGACAGAGGGCGTGAGTTGCAGCCCCGTCCTGACCGTCCCCCTTCGGTCCGTAGATGTCT 



198 MAE2, 226 ECOR11 SCRF1, 230 SFAN1, 



241 




TTTGTCGCACCGGGGGAGCGCCCTCCGGCATGTTCCiAUT^lV^T^^i^xox^x^^^ 
AAACACCGTCGCCCCCTCGCGGGAGGCCGTACAAGCTGAGCAGGCAGGAGACACTCACGG 



246 BAN1 NLA1V, 250 HPA11 NCI1 SCRF1, 257 HAEll, 258 HHA1, 2 
62 MNL1, 265 HPA11, 268 NSPC1, 269 NLAlll, 274 TAQ1, 276 HIN 
Fl, 287 MNL1, 296 BSP1286, 

Arglle 
301 CGAATTC 
GCTTAAG 

302 EC0R1, 
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AlaValAspPhelleProValGlxiAsnLeuGluThrThrMetArgSerProValPheThr 
GCGGT6GACTTTATCCCTGTGGAGAACCTAGAGACAACCAT6AGGTCCCCGGTGTTCACG 
CGCCACCTGAAATAGGGACACCTCTTGGATCTCTGTTGGTACTCCAGGGGCCACAAGTGC 

A AAA AA 

29 MAE1, 40 NLA111, 43 MNL1, 45 AVA2 NLA1V SAU96 , 49 NCI1 SC 
RF1, 50 HPAll, 

AspAsnSerSexProProValValProGlnSerPheGlnValAlaHisLeuHisAlaPro 
GATAACTCCTCTCCACCAGTAGTGCCCCAGAGCTTCCAGGTGGCTCACCTCCATGCTCCC 
CTATTGAGGAGAGGTGGTCATCACGGGGTCTCGAAGGTCCACCGAGTGGAGGTACGAGGG 

A A A A AAA 

69 MNL1, 83 BSP1286, 92 ALUl, 97 EC0R11 SCRF1, 106 HPH, 109 
MNL1, 113 NLA111, 

ThrGlySerGlyLysSerThrLysValProAlaAlaTyrAlaAlaGlnGlyTyrLysVal 
ACAGGCAGCGGCAAAAGCACCAAGGTCCCGGCTGCATATGCAGCTCAGGGCTATAAGGTG 
TGTCC6TCGCC6TTTTC6TGGTTCCAGGGCCGAC6TATAC6TC6AGTCCC6ATATTCCAC 

AA A A AA A A AAA 

126 BBV FNU4H1, 127 NSPB11, 129 FNU4H1, 145 AVA2 NLA1V SAU96 
, 148 NCI1 SCRF1, 149 HP All, 152 BBV FNU4H1, 156 NDEl, 161 B 
BV FNU4H1/ 163 ALUl, 165 DDE1, 

LeuVall^uAsnProSerValAlaAlaThrLeuGlyPheGlyAlaTyrMetSerLysAla 
CTAGTACTCAACCCCTCTGTTGCTGCAACACTGGGCTTTGGTGCTTACATGTCCAAGGCT 
GAT^TGAGTTGGGGAGACAACGACGTTGTGACCCGAAACCACGAATCTACAGGTTCCGA 

182 MAE1, 184 SCA1, 185 RSA1, 195 MNL1, 203 BBV FNU4H1, 228 
AFL111 NSPC1, 229 NLA111, 

HisGlylleAspProAsnlleArgThrGlyValArgThrlleThrThrGlySerProIl 

CATGGGATCGATCCTAACATCAGGACCGGGGTGAGAACAATTACXIACTCGCAGCCCCATC 

GTACGCTAGCTAGGATTGTAGTCCTGGCCCCACTCTTGTTAATGGTGACOGTCGGGGTAG 

A AAAA A AAA 

242 NLA111, 246 BIN1, 247 MB01 SAU3A, 248 CLA1, 249 TAQ1, 25 
1 BIN1 MB01 SAU3A/ 264 AVA2 SAU96, 267 HPA11 NCI1 SCRF1, 271 
HPH, 291 BBV FNU4H1, 

ThrTyrSerThrTyrGlyLysPheLeuAlaAspGlyGlyCysSerGlyGlyAlaTyrAsp 
ACGTACTCCACCTACGGCAAGTTCCTTGCCGACGGCGGGTGCTCGGGGGGCGCTTATGAC 
TGCATGAGGTGGATGCCGTTCAAGGAACGGCTGCCGCCCACGAGCCCCCCGCGAATACTG 



302 MAE2, 304 RSAl, 340 BSP1286 HGIA, 
51 HHA1, 



343 AVA1, 350 HAE11, 3 



IlellelleCysAspGluCysHisSerThrAspAlaThrSerlleLeuGlylleGlyThr 
ATAATAATTTGTGACGAGTGCCACTCCACGGATGCCACATCCATCTTGGGCATTGGCACT 
TATTATTAAACACTGCTCACGGTGAGGTGCCTACGGTGTAGGTAGAACCCGTAACCGTGA 

A A A A 

372 MAE3, 391 F0K1, 392 SFAN1, 399 F0K1, 

ValLeuAsptSlnAlaGluThrAlaGlyAlaArgLeuValValLeiiAlaThrAlaThrPro 
GTCCTTGACCAAGCAGAGACTGCGGGGGCGAGACTGGTTGTGCTCGCCACCGCCACCCCT 
CAGGAACTGGTTCGTCTCTGACGCCCCCGCTCTGACCAACACGAGCGGTGGCGGTGGGGA 

A A 

431 TTHIII2, 435 ALWN1, 461 BSP1286 HGIA, 479 MNL1, 




FIG.79B 

PrcsGiySerValThrValProHisProAsnlleGluGluValAlaLeuSerThrThrGly 
482 CCGGGCTCCGTCACTGTGCCCCATCCCAACATCGAGGAGGTTGCTCTGTCCACCACCGGA 
GGCCCGAGG^GTGACACGGGGTAGGGTTG^ 

482 HPA11 NCI1 SCRFl, 484 BANll BSP1286, 485 NLA1V, 491 MAE3 
, 497 BSP1286, 503 F0K1, 513 TAQ1, 515 MNL1, 518 MNL1, 537 H 
PA11, 

GluIleProPheTyrGlyLysAlalleProLeuGluVallleLysGlyGlyArgHisLeu 
542 GAGATCCCTTTTTACGGCAAGGCTATCCCCCTCGAAGTAATCAAGGGGGGGAGACATCTC 
CTCTAGGGAAAAATGCCGTTCCGATAGGGGGAGCTTCATTAGTTCCCCCCCTCTGTAGAG 

543 XH02, 544 BIN1 MBOl SAU3A, 571 MNL1, 573 TAQ1, 

IlePheCysHisSerLysLysLysCysAspGluLeuAlaAlaLysLeuValAlaLeuGly 
602 ATCTTCTGTCATTCAAAGAAGAAGTGCGACGAACTCGCCGCAAAGCTGGTCGCATTGGGC 
TAGAAGACAGTAAGTTTCTTCTTCACGCTGCTTGAGCGGCGTTTCGACCAGCGTAACCCG- 

603 MBOll, 619 MBOll, 638 FNU4H1, 645 ALUl, 660 SFANl, 

IleAsnMaValMaTyrTyrArgGlyLeuAspValSerVallleProThrSerGlyAsp 
662 ATCAATGCCGTGGCCTACTACCGCGGTCTTGACGTGTCCGTCATCCCGACCAGCGGCGAT 
TAGTTACGGCACCGGATGAT^CGCCAGAACTCCACAGGCAGTAGGGCTGGTCGCCGCTA 

672 HAE1, 673 HAE111, 682 NSPB11 SAC2, 683 THAI, 693 AFL111 
MAE2/ 703 FOK1, 712 NSPB11, 714 FNU4H1, 

ValValValVamaThrAspAlalieimetThrGlyTyrThrGlyAspPheAspSerVal 
722 GTTGTCGTCGTGGCAACCGATGCCCTCATGACCGGCTATACCGGCGACTTCGACTCGGTG 
GAACAGCAGCACCG TTGGCTACGGG AGTACTGGCCG ATATGGCCGC TGAAGCTG AGCCAC 

740 SFANl, 745 MNL1, 748 NLA111, 753 HPA11, 762 HPA11, 771 T 
AQ1, 773 HINF1, 778 HPH, 

IleAspCysAsnThrCysValThrGlnThrValAspPheScrl^uAspProThrPheThr 
782 ATAGACTGCAATACGTGTGTCACCCAGACAGTCGATTTCAGCCTTGACCCTACCTTCACC 
TATCTGACGTTATGCACACAGTGGGTCTGTCAGCTAAAGTCGGAACTGGGATGGAAGTGG 



794 AFL111 MAE2, 800 MAE3, 801 HPH, 813 TAQ1, 837 HPH, 

IleGluThrlleThrLeuProGlnAspAlaValSerArgThrGlnArgArgGlyArgThr 
842 ATTGAGAGAATCACGCTCCCCCAAGATGCTGTCTCCCGCACTGAACGTCGGGGCAGGACT 
TAACTCTGTTAGTGCGAGGGGGTTCTACGACAGAGGGCGTGAGTTGCAGCCCCGTCCTGA 

866 SFANl, 886 MA£2, 

GlyArgGlyLysProGlylleTyrArgPheValAlaProGlyGluArgProSerGlyMet 
902 GGO\GGGGGAAGCCiU3GCATCTACAGATTTGTGGCACCGGGGGAGCGCCCCTCCGGCATG 
CCGTCCCCCTTCGGTCCGTAGATGTCTAAACACCGTGGCCCCCTCGCGGTOA^CCGTAC 

914 EC0R11 SCRFl, 918 SFANl, 934 BAN1 NLA1V, 938 KPA11 NCI1 
SCRFl, 945 HAE11, 946 HHA1, 948 BGL1, 951 MNL1, 954 HPA11, 9 
57 NSPC1, 958 MLA111, 

PheAspSerSerVall^uCysGluCysTyrAspAlaGlyCysAlaTrpTyrGluLeuThr 
962 TTCGACTCGTCC^TCCTCTGTCAGTGCTATGACGCAGGCTGTGCTTC 

AAGCTGAGCAGGCAGGAGACACTCACGATACTGCGTCCGACACGAACCATACTCGAGTGC 

963 TAQ1, 965 HINF1, 976 MNL1, 992 HGA1, 1003 TTHIII2, 1013 
BANll BSP1286 HGIA SAC1, 1014 ALU1, 




1082 



1142 



1202 



1262 



FIG. 790 

1051 RSA1, 1054 NLA111, 1063 AVA1 NCI 1 SCRF1 SMAl, 1064 HPA1 
1 NCI1 SCRF1, 1081 ECOR11 SCRF1, 

GlnAspHisI^uGluPheTrpGluGlyValPheThrGlyl^uThrHisIleAspAlaHis 
CAGGACCATCTTGAATTTTGGGAGGGCGTCTTTACAGGCCTCACTCATATAGATGCCCAC 
GTCCTGGTAGAACTTAAAACCCTCCCGCAGAMTGTCCGGAGTGAGTATATCTACGGGTG 

1084 AVA2 SAU96, 1103 MNL1, 1106 AHA11, 1107 HGA1, 1117 HAE1 
STU1, 1118 HAE111, 1120 MNL1, 1133 SFAN1, 

PheLeuSerGlnThrLysGlnSerGlyGluAsnLeuProTyrLeuValAlaTyrGlnAla 
TTTCTATCCCAGACAAAGCAGAGTGGGGAGAACCTTCCTTACCTGGTAGCGTACCAAGCC 
AAAGATAGGGTCTGTTTCGTCTCACCCCTCTTGGAAGGAATMACCATCG^TGGTTCGG 

1183 ECOR11 SCRF1, 1192 RSA1, 1201 DRA3, 

ThrValC^sMaArgAlaGlnAlaProProProSerTrpAspGlnMetTrpLysCysLeu 
ACCGTGTGCGCTAGGGCTCAAGCCCCTCCCCCATCGTGGGACCAGATGTGGAAGTGTTTG- 
TGG«CACGCGATCCC<^GTTCGG^AGGGGGTAGCACCCTGGTCT 

1209 HHA1, 1212 MAE1/ 1215 BAN11 BSP1286, 1226 MNL1, 1239 NL 
A1V, 1240 AVA2 SAU96, 1256 TTHIII2, 1261 HINF1, 

IleArgl^uLysProThrl^uHisGlyProThrProLeuLeuTyrArgLeuGlyAlaVal 
ATTCGCCTCAAGCCCACCCTCCATGGGCCAACACCCCTGCTATACAGACTGGGCGCTGTT 
TAAGCGGAGTTCGGGTG^A^TACCCOT^ 

1267 MNL1, 1279 MNL1, 1282 NCOl, 1283 NIA111, 1286 SAU96, 12 
87 HAE111, 1313 HAE11, 1314 HHA1, 

GlnAsnGluIleThrLeuThrHisProValThrLysTyrlleMetThrCysMetSerAla 
CAGAATGAAATCACCCIX^CGCACCCAGTCACCAAATACATCATGACATGCATGTCGGCC 
G TCTTACTTTAGTGGGACTGCG TGGGTCAG TGGTTTATG TAGTACTG TACGTACAGCCGG 

1332 HPH, 1339 HGA1, 1349 MAE3, 1350 HPH, 1363 NLA111, 1367 
NSPC1, 1368 NLA111, 1369 AVA3 NSI1, 1371 NSPC1, 1372 NLA111, 
1377 CFR1 XMA3/ 1378 HAE111, 

AspI^uGluValValThrSerThrTrpValLeuValGlyGlyValLeuAlaAlaLeuAla 
GACCTGGAGGTCGTCACGAGCACCTCGGTGCTCGTTGGCGGCGTCCTGGCTGCTTTGGCC 
CT^ACCTCCMCAGTGCTCGTGGACCCACGAGCMCCGCg; 

1384 EC0R11 SCRF1, 1385 GSU1, 1388 MNL1, 1394 MAE3, 1399 BSP 
1286 HGIA/ 1404 ECORll SCRFl, 1409 BSP1286 HGIA, 1419 FNU4H1 
, 1421 AHA11, 1422 HGA1, 1426 ECORll SCRFl, 1430 BBV FMT4H1, 
1437 CFR1, 1438 HAE111, 1439 FNU4H1, 1441 THAI, 

AlaTyrCysI^uSerThrGlyCysValVallleValGlyArgValVaU^erGlyLys 
GCGTATTGCCTCTCAACAGGCTfcCGTGGTCATAGTGGGCAGGGTCGTCTTGTCC^ 
CGCATAACGGACAGTTGTCCGACGCACCAGTATCACCCGTCCCAGCAGAACAGGCCCTTC 

1453 HINC11, 1461 BBV FNU4H1, 1494 HPAll NCI1 SCRFl, 1501 HA 
El, 

ProAlallelleProAspArgGluValLeuTyrArgGluPheAspGluMetGluGluCys 
1502 CCGGCAATCATACCTGACAGGGAAGTCCTCTACCGAGAGTTCGATGAGATGGAAGAGTTC 
GGCCGTTAGTATGGACTGTCCCTTCAGGAGAIGGCTCTCAAGCTACTCTACC^ 

1502 HPAll, 1528 MNL1, 1542 TAQ1, 1553 MBOll, 1558 BSP1286 H 
GIA, 

SeiGlnHisI*uProTyrn«^luGl^ 
1562 TCTCAGCACOTACCGTACATCGAGCAAGGGATGATGCTCGCCGAGCAGTTCAJ^ 
AGAGTCGTGAATGGCATGTAGCTCGTTCCCTACTACGAGCGGCT^ 

1S63 DDE1. 1576 RSAl, 1581 TAQ1, 1590 FOK1, 1594 SFANl, 1612 



1322 



1382 



1442 



TTHIII2, 1621 HAE111 SAU96, FIG; 79 D 

AlaLeuGlyLeuLeuGlnThrAlaSerArgGlnAlaGluVallleAlaProAlaValGln 
1622 GCCCTCGGCCTCCTGGAGACCGCGTCGCGTCAGGCAGAGGTTATCGCCCCTGCT6TCCAG 
CG^AGCCTCAG^CGTCTGGCGCAGGGCAGTCCGTCTCCAATAGCGGGGACGACAGGTC 

1624 MNL1, 1628 HAE111, 1630 MNL1, 1634 PST1, 1639 TTHIII1, 
1642 THAI/ 1643 HGA1, 1658 MNL1, 

ThrAsnTrpGlnLysLeuGluThrPheTrpAlaLysHisMetTrpAsnPhelleSerGly 
1682 ACCAACTGGCAAAAACTCGAGACCTTCTGGGCGAAGCATATGTGGAACTTCATCAGTGGG 
TGGTTGACCGTTTTTGAGCTCTGGAAGACCCGCTTCGTATACACCTTGAAGTAGTCACCC 

1697 AVA1 XHOl, 1698 TAQ1, 1718 NDE1, 

IleGlnTyrLeuAlaGlyLeuSerThrLeuProGlyAsnProAlalleAlaSerLeuMet 
1742 ATACAATACTTCGCGGGCTTGTCAACGCTGCCKWTAACCCCGCCATTGCTTCATTGATG 
TATGTTATGAACCGCCCGAACAGTTGCGACGGACCATTGGGGCGGTAACGAAGTAACTAC- 

1762 HINC11, 1768 BBV FNU4H1, 1772 BCOR11 SCRF1, 1775 BSTE2, 
1776 MAE3/ 

AlaPheThrAlaMaValThrSerProI^uThrThrSerGlnThrLeuLeuPheAsnlle 
1802 GCTTTTACAGCTGCTGTCACCAGCCCACTAACCACTAGCCAAACCCTCCTCTTCAACATA 
CGAAAATGTCGACGACMTGGTCGGGTGATTGGTUTCGGTTTG^ 

1809 ALWN1 HSPB11 FVU11, 1810 ALU1, 1811 BBV FNU4H1, 1817 MA 
E3, 1818 HPH, 1836 MAE1, 1846 MNL1, 1849 MNL1, 1851 MBOll, 

I^uGlyGlyTrpValMaAlaGlnLeuAlaAlaProGlyAlaAlaThrAiaPheValGly 
1862 TTGGGGGGGTGGGTGGCTGCCCAGCTCGCCGCCCCCGGTGCCGCTACTGCCTTTGTGGGC 
AACCCCCCttCCCACCGAO^TCGAG^ 

1877 BBV FNU4H1, 1884 ALU1, 1889 FNU4H1, 1895 NCI1 SCRF1, 18 
96 HPAll, 1898 BAN1 NLA1V, 1901 FNU4H1, 1919 HAE11, 1920 HHA 



AlaGlyLeuAlaGlyAlaAlalleGlyScrValGlyLeuGlyLysValLeuIleAspIle 
1922 GCTGGCTTAGCTGGCGCCGCCATCGGCAGTGTTGGACTGGGGAAGGTCCTCATAGACATC 
CGACCGAATCGACCGCGGCGGTAGCCGTCACAACCTGAC^ 

1927 DDE1, 1930 ALU1, 1934 AHA11 BAN1 HAE11 KAR1 NLA1V, 1935 
HHA1, 1937 FNU4H1, 1966 AVA2 SAU96, 1969 MNL1, 1978 FOK1, 

LeuAlaGlyTyrtlyAlaGl^alMaGlyAlaLe^ 
1982 CTTGCAGGGTATGGCGOGGGCGTGGOGGGAGCTCTTGTGGCATTCAAGATCATC 
GAACGTCCCAIACCGCGCCCGCACC^ 

1995 HHA1/ 1996 THAI, 2010 BAN11 BSP1286 HGIA SAC1, 2011 ALU 
1, 2021 BSM1, 2029 MBOl SAU3A, 2032 NLA111, 2039 HPH, 

GluValProSerThrtluAspI^uValAsnLeul^uProAlallel^uSerProGlyAla 
204 2 GAGGTCCCCTCCACGGAGGACCTGGTCAATCTACTGCCG^CATCCTCTCGCCCGGAGCC 
CTCCAGGGGAGGTGCCTCCTGGACCAGTTAGATGACGGGCGGTAGGAGAGCGGGCCTGGG 

2042 MNL1, 2044 AVA2 NLA1V SAU96, 2049 MNL1, 2057 MNL1, 2059 
AVA2 SAU96 , 2060 TTHIII1, 2062 EC0R11 SCRF1, 2083 FOK1, 208 
6 MNL1, 2093 NCI1 SCRF1, 2094 HPA11, 2096 NLA1V, 2097 BAN11 
BSP1286, 2101 MNL1, 

I^uValValGlyValVal^sAlaAlallel^uArgArgHisValGlyPrtDGlyGluGly 
2102 CTCGTAGTCGGCGTGGTCTGTGCAGCAATACTCCGCCGGCACGTTGGCCC^^ 
GAGCATCAGCCGGACCAGACACGTCGTTATGACGCGGCCGTC 

2123 BBV FNU4H1, 2134 HHAl, 2136 NAE1# 2137 HPAll, 2142 MAE2 
, 2147 HAE111 SAU96 , 214 9 AVA1 NCI1 SCRF1 SMA1, 2150 HPAll H 



FIG. 79E 



CIl SCRF1, 2156 MNL1, 

AlaValGlnTrpMetAsnArgLeuIleAlaPheAlaSerArgGlyAsnHisValSer 
2162 GCAGTGCAGTGGATGAACCGGCTGATAGCCTTCGCCTCCCGGGGGAACCATGTTTCCCC 
CGTCACGTCACCTACTTGGCCGACTATCGGAAGCGGAGGGCCCCCTTGGTACAAAGGGG 

2172 F0K1, 2179 HP All, 2196 MNL1, 2199 AVA1 NCI1 SCRF1 SMAl, 
2200 HPA11 NCI1 SCRF1, 2205 NLA1V, 2210 NLA111, 
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i -•!• , 10 ; 20 30 40 

GAATTCGGACGACGCAAGGTTGCAATTGCTCTATCTATCCCGGCCATAT 

Y **»•*»••**•*»*•*****«***«*•****•**•** + •*•* 

A *•*•••••»••••••*•••••••••••»*•••••••*••••• 

/SSp CTCTCCCAGGCGCCACTGGACGACGCAAGGTTGCAATTGCTCTATCTATCCCGGCCATAT 
550 560 570 580 590 600 

50 60 70 80 A 90 100 

AACAGGTCACCGCATGGCATGGGATATGATGATGAACTGGTCCCCTACGACGGCGTTAGT 

: : : ******•••••»•** 

AACGGGTCACCGCATGGCATGGGATATGATGATGAACTGGTCCCCTACGACGGCGTTGGT 
610 620 630 640 650 660 

110 120 130 140 150 160 

GGTAGCTCAGCTGCTCCGGATCCCACAAGCCATCTTGGACATGATCGCTGGTGCTCACTG 

AATCGCTCAGCTCCTCCGGATCCCACAAGCCATCTTGGAC^ 

670 680 690 700 710 720 

170 180 190 200 210 220 

GGGAGTCCTGGCGGGCATAGCGTATTTCTCCATGGTGGGGAACTGGGCGAAGGTCTTGGC 



GGGAGTCCTGGCGGGCATAGCGTATTTCTCCATGGTGGGGAACTGGGCGAAGGTCCTGGT 
730 740 750 760 770 780 

230 240 250 260 270 280 

AGTGCTGCTGCTATTTGCCGGCGTCGACGCGGAAACCCACGTCACTGGGGGGATCGCCGC 

AGTGCTGCTGCTATTTGCCGGCGTCGACGCGGAAACCCACGTCACCGGGGGAAGTGCCGG 
790 800 810 820 830 840 

290 300 310 320 330 340 

CAAAACTACGGCTAGCCTTACTGGTCTCTTCAATTTAGGTGCCAAGCAGAACATCCAGCT 

«• • • • • • ■ *• ••• *•*»••* 

* • • • • ♦ • ■ ••••«•* 

CCACACTGTGTCTGGATTTGTTAGCCTCCTCGCACCAGGCGCCAAGCAGAACGTCCAGCT 

850 860 870 880 890 900 

350 360 370 380 390 400 

GATCAACACCAACGGCAGTTGGCACATCAACAGGACGGCCTTGAACTGCAATGATAGCCT 

*•*••***•***• ***•••**•*•» • * * * * * ****** ••**•**•**••*•*•*** 
«•**«***•*»•»**••••**•*•* •*•* ** ***•*• 

GATCAACACCAACGGCAGTTGGCACCTCAATAGCACGGCCCTGAACTGCAATGATAGGCT 
910 920 930 940 950 960 

410 420 

CAACACCGGCTGGAATTC 
::::::::::: :X 

CAACACCGGCTGGTTGGCAGGGCTTTTCTATCACCACAAGTTCAACTCTTCAGGCTGTCC 
970 980 990 1000 1010 1020 

FIG. 86 



AA #117-308 (putative envelope region) . FIG. 87 

1) HOT #18 (USA) 3 clones sequenced 

2) JH23 (USA) ? 

3) JH 27 (USA) . ? 

4) PBL-Th (USA) 2 clones sequenced 

5) EC1 (Italy) 3 clones sequenced 

6) HCV-1 (chimpanzee) multiple 

C/M<-r-*S 

1) <» 

2) 
3) 
4) 
5) 

6)RNLGKVIDTLTCGFADLMGYIPLVGAPLGGAARALAHGVRVLEDGVNYATGNL 

D H 

2) 

3) S T T 

4) L 

5) (F) S 

6) PGCSFSIFLLMJ_SCLTVPASAYQVRNSTGLYHVTNDCPNSSIVYEAADAILH 

1) (H) V V T 

2) A D V V K T 

3) S PVA N 

4) A ART 

5) H V T 

6) TPGCVPCVREGr^SRCWVAMTPTVATRDGKLPATQLRRHIDLLVGSATLCS 
1) 

2) I D 

3) D 
4) 

5) I 

6) ALYVGDLCGSvTT.VGQLFTFSPRRHWTTCX3CNCSI 
SUMMARY: "S" AA117-308 (93Z) 

HCT018, PBL-Th, EC1 (Italy) have 97% homology with HCV-1 

JH23 and JH 27 have 96% and 95% homology with HCV-1 .respectively 



AA#300-438 ( C-terminal region of the putative envelope region anH 
amino -1/3 of NSn 



1) JH23 

2) JH27 : 

3) Japanese isolate (T. Miyamura) 

4) EC10 (Italy) 



5) HCV-1 (chimpanzee) 



1) D 

2) D 

3) 
4) 

5)TTQGCNCSIYPGHITGHRMAWDMMMNWSPTTALVMAQLLRIPQAILDMIAGA 



■• ? - 
? 

? ' 

2 clones sequenced 
(one nt difference, which did not 
result in an amino acid change) 
multiple 
s < | > nsi 
a v 

A 

vs 



VM V 



1) 

2) 
3) 



M 



L Y 



I M 



R ARSTA VA 

T YT N AR TQALT F 
GH R VQ VT TLT 



4 ) A I A K TASLTA 

5) HWGVLAGIAYFSMVGNWAKVLVVLLLFAGVDAETHVTGGSAGHTVSGFVSL 



1 )FS R l 

2) FT D I 

3) FR S Kl 

4) FNL I 



R 
R 
R 



T 
D 
Q 



5)L^PGAKQNVQLINTNGSWHLNSTALNCNDSLNTGWL 



SUMMARY: NS 1 AA 330-660 

"Isolate" ^Homology (AA330-438) 

JH23 83 
JH27 80 
Japanese 73 
EC10 (Italy) 84 



^Homology CAA383-405) 

57 
39 
48 
48 
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5' terminus— — — ' 

CACTCCACCATGAATCACTCCCCTGTGAGGAACTACTGTCTT.CACGCAGAAAGCGTCTAG 
CCATGGCGTTAGTATGAGTGTCGTGCAGCCTCCAGGACCGCCGCTCCGGGGAGAGCCATA 
GTGGTCTGCGGAACCGGTGAGTACACCGGAATTGCCAGGAGGACCGGGTCCTTTCTTGGA 
TCAACCCGCTCAATGCCTGGAGATTTGGGCGTGCCCCCGCAAGACTGCTAGCCGAGTAGT 
GTTGGGTCGCGAAAGGCCTTGTGGTACTGCCTGATAGGGTGCTTGCGAGTGCCCCGGGAG-300 

(Putative initiator methionine codon) 

I G C 

GTCTCGTAGACCGTGCACCATGAGCACGAATCCTAAACCTCAAAAAAAAAACAAACGTAA 
CACCAACCGTCGCCCACAGGACGTCAAGTTCCCGGGTGGCGGTCAGATCGTTGGTGGAGT 
TTACTTGTTGCCGCGCAGGGGCCCTAGATTGGGTGTGCGCGCGACGAGAAAGACTTCCGA 
GCGGTCGCAACCTCGAGGTAGACGTCAGCCTATCCCCAAGGCTCGTCGGCCCGAGGGCAG 
GACCTGGGCTCAGCCCGGGTACCCTTGGCCCCTCTATGGCAATGAGGGCTGCGGGTGGGC-600 
GGGATGGCTCCTGTCTCCCCGTGGCTCTCGGCCTAGCTGGGGCCCCACAGACCCCCGGCG 
TAGGTCGCGCAATTTGGGTAAGGTCATCGATACCCTTACGTGCGGCTTCGCCGACCTCAT 
GGGGTACATACCGCTCGTCGGCGCCCCTCTTGGAGGCGCTGCCAGGGCCCTGGCGCATGG 
CGTCCGGGTTCTGGAAGACGGCGTGAACTATGCAACAGGGAACCTTCCTGGTTGCTCTTT 

CTCTATCTTCCTTCTGGCCCTGCTCTCTTGCTTGACTGTGCCCGCTTCGGCCTACCAAGT-900 

GCGCAACTCCACGGGGCTTTACCACGTCACCAATGATTGCCCTAACTCGAGTATTGTGTA 

CGAGGCGGCCGATGCCATCCTGCACACTCCGGGGTGCGTCCCTTGCGTK 

CGCCTCGAGGTGTTGGGTGGCGAT6ACCCCTACGGTGGCCACCAGGGATGGCAAACTCCC 

CGCGACGCAGCTTCGACGTCACATCGATCTGCTTGTCGGGAGCGCCACCCTCTGTTCGGC 

CCTCTACGTGGGGGACCTATGCGGGTCTGTCTTTCTTGTCGGCCAACTGTTCACCTTCTC-1200 

TCCCAGGCGCCACTGGACGACGCAAGGTTGCAATTGCTCTATCTATCGCGGCCATATAAC 

G 

GGGTCACCGCATGGCATGGGATATGATGATGAACTGGTCCCCTACGACGGCGTTGGTAAT 
GGCTCAGCTGCTCCGGATCCCACAAGCCATCTTGGACATGATCGCTGGTGCTCACTGGGG 
AGTCCTGGCGGGCATAGCGTATTTCTCCATGGTGGGGAACTGGGCGAAGGTCCTGGTAGT 
GCTGCTGCTATTTGCCGGCGTCGACGCGGAAACCCACGTCACCGGGGGAAGTGCCGGCCA-1500 



CACTGTGTCTGGATTTGTTAGCC 
CAACACCAACGGCAGTTGGCACC 
CACCGGCTGGTTGGCAGGGCTTT 
GA6GCTAGCCAGCTGCCGACCCC 



fCCTCGCACCAGGCGCCAAGCAGAACGTCCAGCTGAT 
"CAATAGCACGGCCCTGAACTGCAATGATAGCCTCAA 
CTATCACCACAAGTTCAACTCTTCAGGCTGTCCTGA 

rTACCGATTTTGACCAGGGCTGGGGCCCTATCAGTTA 

TGCCAACGGAAGCGGCCCCGACCAGCGCCCCTACTGCTGGCACTACCCCCCAAAACCTTG-1800 
CGGTATTGTGCCCGCGAAGAGTGTGTGTGGTCCGGTATATTGCTTCACTCCCAGCCCCGT 
GGTGGTGGGAACGACCGACAGGTCGGGCGCGCCCACCTACAGCTGGGGTGAAAATGATAC 
GGACGTCTTCGTCCnAACAATACCAGGCCACCGCTGGGCAAnGGnCGGTTGTACg^ 
GATGAACTCAACTGGAnCACC AAAGTGTGCGGAGCGCCTCCnGTGTCATCGGAGGGGC nn 
GGGCAACAACACCCTGCACTGCCCCACTGATTGCTTCCGCAAGCATCCGGACGCCACATA-2100 

C 

CTCTCGGTGCGGCTCCGGTCCCTGGATCACACCCAGGTGCCTGGTCGACTACCCGTATAG 
GCTTTGGCATTATCCTTGTACCATCAACTACACCATATTTAAAATCAGGATG^ 
AGGGGTCGAACACAGGCTGGAAGCTGCCTGCAACTGGACGCGGGGCGAACGTTGCGATCT 



GGAAGACAGGGACAGGTCCGAGC 
CCTCCC6TGTTCCTTCACAACCC 



CAGCCCGTTACTGCTGACCACTACACAGT6GCAGGT 
ACCAGCCTTGTCCACCGGCCTCATCCACCTCCACCA-2100 



GAACAnGTGGACGTGCAGTACnGTACGGGGTGGGGTCAAGCATCGCGTCCTGGG^ 
TAAGTGGGAGTACGTCGTTCTCCTGTTCCTTCTGCTTGCAGACGCGCGCGTCTGCTCCTG 
CTTGTGGATGATGGTACTCATATCCCAAGCGGAGGCGG^ 

TAATGCAGCATCCCTGGCCGGGACGCACGGTCTTGTATCCTTCCTCGTGTTCn o , nn 

TGCATGGTATTTGAAGGGTAAGTGGGTGCCCGGA6CGGTCTACACCTTCTACGGGATGTG-2700 

GCCTCTCCTCCTGCTCCT6TTGGCGTTGCCCCAGCGGGCGTACGCGCTGGACACGGAGGT 

GGCCGCGTCGTGTGGCGGTGTTGTTCTCGTCGGGTTGATGGCGCTGACTCTGTCACCATA 

TTACAAGCGCTATATCAGCTGGTGCTTGTGGTGGCTTCAGTATTTTCTGACCAGAGTGGA 

A6CGCAACTGCACGTGTGGATTCCCCCCCTCAACGTCCGAGGGGGGCGCGACGCCGTCAT 
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CTTACTCATGTGTGCTGTACACCCGACTCTGG^ 

CGTCTTCGGACCCCTTTGGATTCTTCAA6CCAGTTTGCTTAAAGTACCCTACTTTGTGCG 

CGTCCAAGGCCTTCTCCGGTTCTGCGCGTTAGCGCGGAAGATGATCGGAGGCCATTACGT 

GCAAATGGTCATCATTAAGTTAGGGGCGCnACTGGCACCTATGTTTATAACCATCTCAC 

TCCTCTTCGGGACTGGGCGCACAAGGGCTTGCGAGATCTGGCCGTGGeTGTAGAGCCAGT 

CGTCnCTCCCAAATGGAGACCAAGCTCATCACGTGGGGGGCAGATACCGCCGCG^ 

TGACATCATCAACGGCTTGCCTGTTTCCGCCCGCAGGGGCCGGGAGATAgGCTCGGGCC 

AGCCGATGGAATGGTCTCCAAGGGGTGGAGGTTGCTGGCGCCCATCACGGCGTACGCCCA 

GCAGACAAGGGGCCTCCTAGGGTGCATAATGACCAGeCTAACTGGCCGGGACAAAAACCA 

AGTGGAGGGTGAGGTCCAGATTGTGTCAACTGCTGCCCAAACCTTCCTGGCAACGTGCAT ?cnn 

CAATGGGGTGTGCTGGACTGTCTACCACGGGGCCGGAACGAGGACCATCGCGTCACCCAA-3600 

T 

GGGTCCTGTCATCCAGATGTATACCAATGTAGACCAAGACCTTGTGGGCTGGCCCGCTCC 

GCAAGGTAGCCGCTCATTGACACCCTGCACTTGCGGCTCCTCGGACQTTACQGGTCAC 
GAGGCACGCCGATGTCATTCCCGTGCGCCGGCGGGGTGATAGCAGGGGCAGCCTGCTGTC 
GCCCCGGCCCATTTCCTACTTGAAAGGCTCCTCGGGGGGTCCGCTGTTGTGCCCCGCGGG 
GCACGCCGTGGGCATATTT^ 

CmATCCCTGTGGAGAACCTAGAGACAACCATGAGGTCCCCGGTGTTCACGGATM 
CTCTCCACCAGTAGTGCCCCAGAGCTTCCAGGTGGCTCA^ 
CGGCAAAAGCACCAAGGTCCCGGCTGCATATGCAGCTCAGGGCTATAAGGTG^ 
CAACCCCTCTGTTGCTGCAACACTGGGCTTTGGTGCTTACATGTCCAAGGCTCATGGGAT 

CGATCCTAACATCAGGACCGGGGTGAGAACAATTACCA^GGCAGCCCCATCA^ 

CACCTACGGCAAGTTCCTTGCCGACGGCGGGTGCTCGGGGGGCGCTTATGACATAATAAT 

TTGTGACGAGTGCCACTCCACGGATGCCACATCCATgTGGGCATCGGCAC^ 

CCAAGCAGAGACTGCGGGGGCGAGACTGGTTGTGCTCGCCACCGCCACCCCTCCGGGCTC 

CGTCACTGTGCCCCATCCCAACATCGAGGAGGTTGCTCTGTCCACCACCGGAGAG^ 

TTTTTACGGCAAGGCTATCCCCCTCGAAGTAATCAAGGGGGGGAGACATCTCATCTTCTG-4500 

TCATTCAAAGAAGAAGTGCGACGAACTCGCCGCAAAGaGGTCGCATT 

CGTGGCCTACTACCGCGGTCTTGACGTGTCCGTCATCCCGACCAGCGGCGATGTTGTCGT 

A 

CGTGGCAACCGATGCCCTCATGACCGGCTATACCGGCMCTTCM 
CAATACGTGTGTCACCCAGACAGTCGATTTCAGCCTTGACCCTACCTTCACCATTGAGAC 
AATCACGCTCCCCCAGGATGCTGTCTCCCGCACTCAACGTCGGGGCAGGAgGGCAGGGG^ 
GAAGCCAGGCATCTACAGATTTGTGGCACCGGG^ 

GTCCGTCCTCTGTGAGTGCTATGACGCAGGCTGTGCTTGGTATGAGaCACGCCCGCCGA 
GACTACAGTTAGGCTACGAGCGTACATGAACACCCCGGGGCTTCCCGTGTGCCAGGA^ 
TCTTGAATTTTGGGAGGGCGTCTTTACAGGCCTCACTCATATAGATGCCCACTTTCTATC pinn 
CCAGACAAAGCAGAGTGGGGAGAACgjCCTTA^ 

CGCTAGGGCTCAAGCCCCTCCCCCATCGTGGGACCAGATGTGGAAGTGTTTGATTCGCCT 

CAAGCCCACCCTCCATGGGCCAACACCCCTGCTATACAGACTGGGCGagjCAM 

AATCACCCTGACGCACCCAGTCACCAAATACATCATGACATGCATGTCG^ 

GGTCGTCACGAGCACCTGGGTGCTCGTTGGCGGCGTCCTGGCTGQTTGGCCGCgATTG 

CCTGTCAACAGGCTGCGTGGTCATAGTGGGCAGGGTCGTCTTGTC^ 

CATACCTGACAGGGAAGTCCTCTACCGAGAGTTCGA^ 

CTTACCGTACATCGAGCAAGGGATGATGCTCGCCGAGCAGT^ 

CCTCCTGCAGACCGCGTCGCGTCAGGCAGAGGTTATCGCCCgGggCCAGACCAACTG 

GCAAAAACTCGAGACCTTCTGGGCGAAGCATATGTGGAAC™^ 

cttggcgggcttgtcaacgctgcctggtaaccccgccattgcttcattgatggcttttac-5700 
agctgctgtcaccagcccactaaccactagccaaaccctcctcttcaacata™ 

GTGGGTGGCTGCCCAGCTCGCCGCCCCCGGTGCCGa^ 
AGCTGGCGCCGCCATCGGCAGTGnGGACTGGGGAjAGGTC^ 
GTATGGCGCGGGCGTGGCG6GAGCTCnGTGGCATTCAAGATCATGAGCGg 
CTCCACGGAGGACCTGGTCAATCTACTGCCCGCCATCCTCTCGCCC6GAGCCCTCGTAGT-6000 

CGGCGTGGTCTGTGCAGCAATACTGCGCCGGCACCTTGGa 
GTGGATGAACCGGCTGATAGCCTTCGCCTCCCGGGGGAACCATGTTTCCCCCACGCACTA 
C6TGCCGGAGAGCGATGCAGCTGCCCGCGTCACTGCCATAgCAGC^ 
CCAGCTCCTGAGGCGACTGCACCAGTGGATAAGCTCGGAGTGTACCACTCCATGCTCCGG 
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ttcctggctaagggacatctgggactggataYgcg^ 
gctaaaagctaagctcatgecacagctgcctgggatcccctttgtgtcctgccagcgcgg 

GTATAAGG6GGTCTGGCGAGTGW 

GATCACTGGACATGTCAAAAACGGGACGATGAGGATCGTGGGTCCTAGGACCTGC^ 
CATGTGGAGTGGGACCTTCCCCATTAATGCCTACACCACGGGCCCCTGT^ ccnn 
TGCGCCGAACTACAGGTTGGGGCTAJGGAGGGTGTCTGCAGAGGAATATGTG^ 
GCAGGTGGGGGACTTCGACTftGGTGACGGGTATGACTACTGACAATCTCAAATG^ 



GCCCCCCTGCAAGCCCTTGCTGCGGGAGGAGGTATCATTCAGAGTAGGACTCCACGAATA 

CCCGGTAGGGTCGCAATTACCTTGCGAGCCCGAACCGGACGTGGCCGTGTTGAW cnnn 

GCTCACTGATCCCTCCCATATAACAGCAGAGGCGGCCGGGCGAAGGnGGC^ 

ACCCCCCTCTGTGGCCAGCTCCTCGGCTAGCCAGCTATCCGCTCCATCTCTCAAGGCAAC 

TTGCACCGCTAACCATGACTCCCCTGATGCTGAGCTCATAGAGGCCAA^ 

GCAGGAGATGGGCGGCAACATCACCAGGGTTGAGTCAGAAAACAAAGTG^ 

CTCCTTCGATCCGCTTGTGGCGGAGGAGGACGAGCGGGAGATCTCCGTACCCGC^ 

CCTGCGGAAGTCTCGGAGATTCGCCCAGGCCCTGCCCGnTGGGCGCGGCCG^ 

CCCCCCGCTAGTGGAGACGTGGAAAAAGCCCGACTACGAACCACCTGTGGTCCATGGCTG 

TCCGaTCCACCTCCAAAGTCCCCTCCTGTGCCTCCGCCTCGGA^ 

CCTCACTGAATCAACCCTATCTACTGCCTTGGCCGA6CTCGCCACCAGAAGCTTTGGCAG 

CTCCTCAACTTCCGGCATTACGGGCGACAATACGACAACATCCTCTGAGCCCGCCCCTTC 

TGGCTGCCCCCCCGACTCCGACGCTGAGTCCTATTCCTCCATGCCCCCCa 

GCCTGGGGATCCGGATCTTAGCGACGGGTCATGGTCAACGGTCAGTAGTGAGGCCAA^ 

GGAGGATGTCGTGTGCTGCTCAATGTCTTACTCTTGGACAGGCGCACTCG^ 

CGCCGCGGAAGAACAGAAACTGCCCATCAATGCACTAAGCAACTCGTTGCTACGT^ 

CAATTTGGTGTATTCCACCACCTCACGCAGTGCTTGCCAAAGGCAGA^ 

TGACAGACTGCAAGTTCTGGACAGCCATTACCAGGACGTACTCAAGGAGG^ 

GGCGTCAAAAGTGAAGGCTAACTTGCTATCCGTAGAGGAAGCTTGCAGCCTGACGCCCCC 

ACACTCAGCCAAATCCAAGTTTGGTTATGGGGCAAAAGA^ 

GGCCGTAACCCACATCAACTCCGTGTGGAAAGACCTTCTGGAAGACAATGTAACA^ 

AGACACTACCATCATGGCTAAGAACGAGGTTTTCTGCGTTCAGCCTGAGA AGGGGGGTCG 

TAAGCCAGCTCGTCTCATCGTGnCCCCGATCTGGGCGTGCGCGTGTGC^ 

TnGTACGACGTGGTTACAAAGCTCCCCTTGGCCGTGATGGGAAGCTCCTA^ 

ATACTCACCAGGACAGCGGGTTGAATTCCTCGTGCAAGCGTGGAAGTCCAAGAAAACCCC 

AATGGGGTTCTCGTATGATACCCGCTGCTTTGACTCCACAGTCACTGAGAGC^ 

TACGGAGGAGGCAATCTACCAATGTTGTGACCTCGACCCCCAAGCCCGCGTGGCCATCA^ 

GTCCCTCACCGAGAGGCTTTATGTTGGGGGCCCTCTTACCAATTCAAGGGGGGAGAACTG-8a00 

CGGCTATCGCAGGTGCCGCGCGAGCGGCGTACTGACAACTAGCTGTGGTA^ 

TTGCTACATCAAGGCCCGGGCAGCCTGTCGAGCCGCAGGGCTCCAGGACTGCACCATGCT 

CGTGTGTGGCGACGACTTAGTCGnATCTGTGAAAGCGCGGGGGTCW 

GAGCCTGAGAGCCTTCACGGAGGCTATGACCAGGTACTCCGCCCCCCC^ 

ACAACCAGAATACGACTTGGAGCTCATAACATCATGCTCCTCCAACGTGT^ 

CGACGGCGCTGGAAAGAGGGTCTACTACCTCACCCGTGACCCTACAACCCCCCTC^ 

AGCTGCGTGGGAGACAGCAAGACACACTCCAGTCAATTCCTGGCTAGG^ 

GTTTGCCCCCACACTGTGGGCGAGGATGATACTGATGACCCATnCTTTAGCGTCCTTAT 

AGCCAGGGACCAGCTTGAACAGGCCCTCGATTGCGAW Qmn 
AGAACCACTGGATCTACCTCCAATCATTCAAAGACTCCATGGCCTCAGCGCATnjCACT-9000 

CCACAGTTACTCTCCAGGTGAAATTAATAGGGTGGCCGCATGCCTCAGAAAACTTGGGGT 

g 

ACCGCCCTTGCGAGCnGGAGACACCGGGCCCGGAGCGTCCGCGCTAG^ 
AGGAGGCAGGGCTGCCATATGTGGCAAGTACCTCTTCAACTGGG^ 
CAAACTCACTCCAATAGCGGCCGCTGGCCAGCTGGA^ Q _ nn 
CTACAGCGGGGGAGACAnTATCACAGCGTGTCTCATGCCCGGCCCCGCT 
TTGCCTACTCCTGCTTGCTGCAGGGGTAGGCATCTACCTCCTCCCCAACCGATGAAGGTT 

GGGGTAAACACTCCGGCCT 3 ' te rm i nus 

Some clonal heterogeneities producing amino acid 
substitutions are shown. There are many other 
"silent mutations (not shown). 
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R T 

MSTNPKPQKKNKRNTNRRPQDVKFPGGGQIVG6VYLLPRRGPRLGVRATR 

KTSERSQPRGRRQPIPKARRPEGRTWAQPGYPWPLYGNEGCGWAGWLLSP-100 

RGSRPSWGPTDPRRRSRNLGKVIDTLTCGFADLMGYIPLVGAPLGGAARA 

LAHGVRVLEDGVNYATGNLPGCSFSIFLLALLSCLTVPASAYQVRNSTGL-200 
YHVTNDCPNSSIVYEAADAILHTPGCVPGVREGNASRCWVAMTPTVATRD 
GKLPATQLRRH I DLLVGSATLCSALYVGDLCGSVFLVGQLFTFSPRRHWT-300 

V 

TQGCNCS I YPGH I TGHRMAWDMMMNWSPTTALVMAQLLR I PQA I LDM I AG 

AHWGVLAGIAYFSMVGNWAKVLVVLLLFAGVDAETHVTGGSAGHTVSGFV-^OO 

SLLAPGAKQNVQLINTNGSWHLNSTALNCNDSLNTGWLAGLFYHHKFNSS 

GCPERLASCRPLTDFDQGWGP I SYANGSGPDQRPYCWHYPPKPCG I VPAK-500 

SVCGPVYCFTPSPVVVGTTDRSGAPTYSWGENDTDVFVLNNTRPPLGNWF 

GCTWMNSTGFTKVCGAPPCVIGGAGNNTLHCPTDCFRKHPDATYSRCGSG-600 

PWLTPRCLVDYPYRLWHYPCT INYTIFKI RMYVGGVEHRLEAACNWTRGE _ 
RCDLEDRDRSELSPLLLTTTQWQVLPCSFTTLPALSTGLIHLHQNIVDVQ-700 
YLYGVGSSIASWAIKWEYVVLLFLLLADARVCSCLWMMLLISQAEAALEN 
LVILNAASLAGTHGLVSFLVFFCFAWYLKGKWVPGAVYTFYGMWPLLLLL-800 

(N) 

LALPQRAYALDTEVAASCGGVVLVGLMALTLSPYYKRYISWCLWWLQYFL 
TRVEAQLHVWIPPLNVRGGRDAVILLMCAVHPTLVFDITKLLLAyFGPLW-900 
ILQASLLKVPYFVRVQGLLRFCALARKMIGGHYVQMVI IKLGALTGTYVY , nnn 
NHLTPLRDWAHNGLRDLAVAVEPVVFSQMETKLITWGADTAACGDIINGL-1000 
PVSARRGREILLGPADGMVSKGWRLLAPITAYAQQTRGLLGCI ITSLTGR 
DKNQVEGEVQ I VSTAAQTF LATC I NGVCWTVYHGAGTRT I ASPKGPV I QM- 1 100 

S T 

YTNVDQDLVGWPAPQGSRSLTPCTCGSSDLYLVTRHADV I PVRRR6DSR6 n 

SLLSPRPISYLKGSSGGPLLCPAGHAVGIFRAAVCTRGVAKAVDFIPVEN-1200 

LETTMRSPVFTDNSSPPVVPQSFQVAHLHAPTGSGKSTKVPAAYAAQGYK 

VLVLNPSVAATLGFGAYMSKAHG IDPNI RTGVRT I TTGSP I TYSTYGKFL- 1300 

ADGGCSGGAYD 1 1 1 CDECHSTD ATS I LG 1 6TVLDQAETAGARLVVLATAT 
PPGSVTVPHPNIEEVALSTTGEIPFYGKAIPLEVIKGGRHLIFCHSKKKC-1400 
DELAAKLVALG I N AVAYYRGLDVSV I PTSGDVWVATD A LMTGYTGDFDS 

Y (S) 
VIDCNTCVTQTVDFSLDPTFTIETITLPQDAVSRTQRRGRTGRGKPGIYR-1500 
FVAPGERPSGMFDSSVLCECYDAGCAWYELTPAETTVRLRAYMNTPGLPV cnn 
CQDHLEFWEGVFTGLTHIDAHFLSQTKQSGENLPYLVAYQATyCARAQAP-1600 
PPSWDQMWKCL I RLKPTLHGPTPLLYRLGAVQNEITLTHPVTKY MTCMS 
ADLEVVTSTWVLVGGVLAALAAYCLSTGCVVIVGRVVLSGKPAI IPDREV-1700 
LYREFDEMEECSQHLPYIEQGMMLAEQFKQKALGLLQTASRQAEVIAPAy , onn 
QTNWQKLETFWAKHMWNF I SG I QYLAGLSTLPGNP A ASLMAFTA AVTSP- 1 800 
LTTSQTLLFN I LGGWVA AQLA APGAATAFVGAGLAGAA I GSVGLGKVL I D 
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(G) 

ILAGYGAGVAGALVAFKIMSGEVPSTEDLVNLLPAILSPGALVVGVVCAA-1900 

(HO 

I LRRHVGPGEGAVQWMNRL I AFASRGNHVSPTHYVPESDAAARVTA I LSS 
LTVTQLLRRLHQWISSECTTPCSGSWLRDIWDWICEVLSDFKTWLKAKLM-2000 

(V) 

PQLPGIPFVSCQRGYKGVWRGDGIMHTRCHCGAEITGHVKNGTMRIVGPR 
TCRNMWSGTFPI NAYTT6PCTPLPAPNYTFALWRVSAEEYVEI RQVGDFH-2100 
YVTGMTTDNLKCPCQVPSPEFFTELDGVRLHRFAPPCKPLLREEVSFRVG 
LHEYPVGSQLPCEPEPDVAVLTSMLTDPSHITAEAAGRRLARGSPPSVAS-2200 
SSASQLSAPSLKATCTANHDSPDAELIEANLLWRQEMGGNITRVESENKV n 
VILDSFDPLVAEEDEREISVPAEILRKSRRFAQALPVWARPDYNPPLVET-2300 

(S) 

WKKPDYEPPVVHGCPLPPPKSPPVPPPRKKRTVVLTESTLSTALAELATR 

(FA) 

SFGSSSTSG I TGDNTTTSSEPAPSGCPPDSD AESYSSMPPLEGEPGDPDL-2100 

SDGSWSTVSSEANAEDVVCCSMSYSWTGALVTPCAAEEQKLPINALSNSL 

LRHHNLVYSTTSRSACQRQKKVTFDRLQVLDSHYQDYLKEVKAAASKVKA-2500 

NLLSVEEACSLTPPHS AKSKFGYG AKDVRCHARKAVTH I NSVWKDLLEDN 

VTPIDTTIMAKNEVFCVQPEKGGRKPARLIVFPDLGVRVCEKMALYDVVT-2600 

KLPLAVMGSSYGFQYSPGQRVEFLVQAWKSKKTPMGFSYDTRCFDSTVTE 

(G) 

SD I RTEEA I YQCCDLDPQARVA I KSLTERLYVGGPLTNSRGENCGYRRCR-2700 

ASGVLTTSCGNTLTCYIKARAACRAAGLQDCTMLVCGDDLVVICESAGVQ 

EDAASLRAFTEAMTRYSAPPGDPPQPEYDLELITSCSSNVSVAHDGAGKR-2800 

VYYLTRDPTTPLARAAWETARHTFVNSWLGNIIMFAPTLWARMILMTHFF 

SVL I ARDQLEQALDCE I YGACYS I EPLDLPP 1 1 QRLHGLSAFSLHSYSPG-2900 

G 

EINRVAACLRKLGVPPLRAWRHRARSVRARLLARGGRAAICGKYLFNWAV 

(P) 

RTKLKLTPIAAAGQLDLSGWFTAGYSGGDIYHSVSHARPRWIWFCLLLLA-3000 
AGVGIYLLPNRO-3011 

Stop codon 



( ) = Heterogeneity due possibly 
to 5' or 3' terminal cloning 
artefact. 
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